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Clinical specimens tSSM In rllnfr iLmjc mbioloov .ahnr^ , 

and b.^ C " ni T' ^ reC8iVed jn C,,niCa ' miCr0bi0, °9y ,ab ^tories are urine 
and blood samples. At the microbio.ogy laboratory of the Centre Hospitaller de 
Universe Lava. (CHUL), urine and blood account for approximately 55% < * 
5 of the specmens receded. respect^y (Tab.e 3). The remaining 15% of clinica 
specmens comprise various biological fluids including sputum, pus. cLbrospina fl l 

7ZT« T ° therS ^ * ,nfeCti0nS ° f ^ ^ * rl aC Z 
n the b ood^ 

In fact, all cl.mcal samples received in the clinical microbiology laboratory are 3d 
10 routinely f or the identiflcauon of bacteria and susceptibility tesling ^ 
cpnyentiongl pathogon iriontifir ntjn airom clinic. « P „ im ^ 
Urine specimens 

The search for pathogens in urine specimens is so preponderant in the routine 
^ofogyiahorafo^^taray^oftestsha^baandevetoped. How v ZoH 

none ,s streaked on piatas and inoohatad for , 8 . 2 4 „ 0ur5 . Cofontes are the olid 
to determme tha total number of colony forming unite (CPU, pe, liter of urine A 

CFIVL o, mora „ urine. However, infections with lesa than 10' CFLVL in urine are 
20 pes*, parlicularfy ,„ paUama wilh . ^ „ 

(Sta* and Maki, ,984. N. Engl J. Med. 3,1:560-564). Importendy. appro»^,ot 

(re. bacterc,! ooun, of less than ,0> CFU/L; Table 3). Urine specimens found pool, tol 
by cuhura are furi^r character^ using standard bloohemica, teas to 
bactena, parogen and are also tested forsusceptibBy ,„ anftiolics. The biochemical 
and S uscep„b,l,ty leallng normally rsquire 13-24 hours of incubalion 

Accurate and rapid urine screening malhoda for bacterial palhogena would 

^afoater^ntolfonofnagauvesp.cinranaandamoraefficiL.raLanT^ 
cane management of patients. S e«a, rapid identification methods (Uriacrcen~ 

rZT'h Tradt ™ DNA and *"•» ha - -mLZZL 

atandart bW c hemK a, methods, which ara based on chura o, me badaria, pamogens 
^ aa m at '^.r "* — «- - P» 

35 66^' : PeZ210 * 1992 ' J ' C "' n - MiCroWo, • 30:64 °- 

Blood specimens 

The blood specimens received in the microbiology laboratory are alwavs 

ct:; e tt rL for r ur v iood cu,ture systems ™> be -zzzz 

complete* automated. The BACTEC system (from Becton Dickinson) and the 
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BacTAIert system (from Organon Teknika Corporation) are the two most widely used 
automated blood culture systems. These systems incubate blood culture bottles under 
optimal conditions for bacterial growth. Bacterial growth is monitored continuously to 
detect early positives by using highly sensitive bacterial growth detectors. Once growth 

5 is detected, a Gram stain is performed directly from the blood culture and then used 
to inoculate nutrient agar plates. Subsequently, bacterial identification and 
susceptibility testing are carried out from isolated bacterial colonies with automated 
systems as described previously. The bottles are normally reported as negative rf no 
growth is detected after an incubation of 6 to 7 days. Normally, the vast majority of 

10 blood cultures are reported negative. For example, the percentage of negative blood 
cultures at the microbiology laboratory of the CHUL for the period February 1994- 
January 1995 was 93.1% {Table 3). 
Other clinical samples 

Upon receipt by the clinical microbiology laboratory, all body fluids other than 
blood and urine that are from normally sterile sites (i.e. cerebrospinal, synovial, pleural, 
pericardial and others) are processed for direct microscopic examination and 
subsequent culture. Again, most clinical samples are negative for culture (Table 3). 

Regarding clinical specimens which are not from sterile sites such as sputum 
or stool specimens, the laboratory diagnosis by culture is more problematic because 
of the contamination by the normal flora. The bacterial pathogens potentially 
associated with the infection are purified from the contaminants and then identified as 
described previously. Of course, the universal detection of bacteria would not be useful 
for the diagnosis of bacterial infections at these non sterile sites. On the other hand, 
DNA-based assays for species or genus detection and identification as well as for the 
detection of antibiotic resistance genes from these specimens would be very useful 
and would offer several advantages over classical identification and susceptibility 
testing methods. 

nMA-hased aatv r ™»h anv clinical specimens ...... 

£~ a ; T obvious n e e d for rapid and accurate diagnostic tests for bactenal 

detection and identification directly from clinical specimens. DNA-based technolog.es 
are rapid and accurate and offer a great potential to improve the diagnosis of infechous 
diseases (Persing ef a/., 1993. Diagnostic Molecular Microbiology: Principles and 
Applications, American Society for Microbiology, Washington, D.C.). The DNA probes 
and amplification primers which are objects of the present invention are applicable for 
35 bacterial or fungal detection and identification directly from any clinical specimens such 
as blood cultures, blood, urine, sputum, cerebrospinal fluid, pus and other type of 
specimens (Table 3). The DNA-based tests proposed in this invention are supenor in 
terms of both rapidity and accuracy to standard biochemical methods currently used 
for routine diagnosis from any clinical specimens in microbiology laboratones. Since 
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these tests are performed in around only one hour, they provide the clinicians with new 
d.agnostic tools which should contribute to increase the efficiency of therapies with 
antimicrobial agents. Clinical specimens from organisms other than humans (e g other 
primates, birds, plants, mammals, farm animals, livestock and others) may also be 
tested with these assays. 

A high percentage of culture na tive sperirr^n c 

Among all the clinical specimens received for routine diagnosis, approximately 
80% of urine specimens and even more (around 95%) for other types of clinical 
specimens are negative for the presence of bacterial pathogens (Table 3) It would 
also be desirable, in addition to identify bacteria at the species or genus level in a 
given specimen, to screen out the high proportion of negative clinical specimens with 
a test detecting the presence of any bacterium (i.e. universal bacterial detection). Such 
a screening test may be based on the DNA amplification by PCR of a highly conserved 
genetic target found in all bacteria. Specimens negative for bacteria would not be 
amplified by this assay. On the other hand, those that are positive for bacteria would 
give a positive amplification signal with this assay. 
Towards the development of raniri D NA-hasprl diagnostic 

A rapid diagnostic test should have a significant impact on the management of 
infections. DNA probe and DNA amplification technologies offer several advantages 
over conventional methods for the identification of pathogens and antibiotic resistance 
genes from clinical samples (Persing etal., 1993. Diagnostic Molecular Microbiology: 
Principles and Applications. American Society for Microbiology. Washington, D.C.; 
Ehrlich and Greenberg, 1994, PCR-based Diagnostics in Infectious Disease, Blackwell 
Scientific Publications, Boston. MA). There is no need for culture of the bacterial 
pathogens, hence the organisms can be detected directly from clinical samples, 
thereby reducing the time associated with the isolation and identification of pathogens.' 
Furthermore, DNA-based assays are more accurate for bacterial identification than 
currently used phenotypic identification systems which are based on biochemical tests. 
Commercially available DNA-based technologies are currently used in clinical 
microbiology laboratories, mainly for the detection and identification of fastidious 
bacterial pathogens such as Mycobacterium tuberculosis, Chlamydia trachomatis. 
Neisseria gonorrhoeae as well as for the detection of a variety of viruses (Podzorski 
and Persing, Molecular detection and identification of microorganisms, In : P. Murray 
etal., 1995. Manual of Clinical Microbiology, ASM press, Washington D.C.). There are 
also other commercially available DNA-based assays which are used for culture 
confirmation assays. 

Others have developed DNA-based tests for the detection and identification of 
bacterial pathogens which are objects of the present invention: Staphylococcus spp. 
(US patent application serial No. US 5 437 978), Neisseria spp. (US patent application 
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serial No. US 5 162 199 and European patent application serial No. EP 0 337 896 131) 
and Listeria monocytogenes (US patent applications serial Nos US 5 389 513 and US 
5 089 386). However, the diagnostic tests described in these patents are based either 
on rRNA genes or on genetic targets different from those described in the present 
5 invention. 

Although there are diagnostic kits or methods already used in clinical 
microbiology laboratories, there is still a need for an advantageous alternative to the 
conventional culture identification methods in order to improve the accuracy and the 
speed of the diagnosis of commonly encountered bacterial infections. Besides being 

1 0 much faster, DNA-based diagnostic tests are more accurate than standard biochemical 
tests presently used for diagnosis because the bacterial genotype (e.g. DNA level) is 
more stable than the bacterial phenotype (e.g. metabolic level). 

Knowledge of the genomic sequences of bacterial and fungal species 
continuously increases as testified by the number of sequences available from 

15 databases. From the sequences readily available from databases, there is no 
indication therefrom as to their potential for diagnostic purposes. For determining good 
candidates for diagnostic purposes, one could select sequences for DNA-based 
assays for (i) the species-specific detection and identification of commonly 
encountered bacterial or fungal pathogens, (ii) the genus-specific detection and 

20 identification of commonly encountered bacterial or fungal pathogens, (Hi) the universal 
detection of bacterial or fungal pathogens and/or (iv) the specific detection and 
identification of antibiotic resistance genes. All of the above types of DNA-based 
assays may be performed directly from any type of clinical specimens or from a 
microbial culture. 

25 In our co-pending U.S. (N.S. 08/526,840) and PCT (PCT/CA/95/00528) patent 

applications, we described DNA sequences suitable for (i) the species-specific 
detection and identification of 12 clinically important bacterial pathogens, (ii) the 
universal detection of bacteria, and (iii) the detection of 17 antibiotic resistance genes. 
This co-pending application described proprietary DNA sequences and DNA 

30 sequences selected from databases (in both cases, fragments of at least 1 00 base 
pairs), as well as oligonucleotide probes and amplification primers derived from these 
sequences. All the nucleic acid sequences described in this patent application enter 
the composition of diagnostic kits and methods capable of a) detecting the presence 
of bacteria, b) detecting specifically the presence of 12 bacterial species and 17 

35 antibiotic resistance genes. However, these methods and kits need to be improved, 
since the ideal kit and method should be capable of diagnosing close to 100% of 
microbial pathogens and antibiotic resistance genes. For example, infections caused 
by Enterococcus faecium have become a clinical problem because of its resistance to 
many antibiotics. Both the detection of these bacteria and the evaluation of their 
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resistance profiles are desirable. Besides that, novel DNA sequences (probes and 
primers) capable of recognizing the same and other microbial pathogens or the same 
and additional antibiotic resistance genes are also desirable to aim at detecting more 
target genes and complement our earlier patent application. 

STATEMENT OF THE INVENTinM 

It is an object of the present invention to provide a specific, ubiquitous and sensitive 
method using probes and/or amplification primers for determining the presence and/or 
amount of nucleic acids: 

- from specific microbial species or genera selected from the group consisting 
of Streptococcus species, Streptococcus agalactiae, Staphylococcus species, 
Staphylococcus saprophytics, Enterococcus species, Enterococcus faecium, 
Neisseria species, Neisseria meningitidis, Listeria monocytogenes, Candida species 
and Candida albicans 

- from an antibiotic resistance gene selected from the group consisting of 
bla^, bla^, bla^, blaZ, aadB, aacC1, aacC2, aacC3, aacA4, aac6'-lla, ermA, ermB, 
ermC, mecA, vanA, vanB, vanC, satA, aac(6 , yaph(2"), aad(&), vat, vga, msrA, su/and 
int, and optionally, 

- from any bacterial species 

in any sample suspected of containing said nucleic acids, 

wherein each of said nucleic acids or a variant or part thereof comprises a 
selected target region hybridizable with said probe or primers; 

said method comprising the steps of contacting said sample with said probes 
or primers and detecting the presence and/or amount of hybridized probes or amplified 
products as an indication of the presence and/or amount of said any bacterial species, 
specific microbial species or genus and antibiotic resistance gene. 

In a specific embodiment, a similar method directed to each specific microbial 
species or genus detection and identification, antibiotic resistance genes detection, 
and universal bacterial detection, separately, is provided. 

In a more specific embodiment, the method makes use of DNA fragments 
(proprietary fragments and fragments obtained from databases), selected for their 
capacity to sensitively, specifically and ubiquitously detect the targeted bacterial or 
fungal nucleic acids. 

In a particularly preferred embodiment, oligonucleotides of at least 12 
nucleotides in length have been derived from the longer DNA fragments, and are used 
in the present method as probes or amplification primers. 

The proprietary oligonucleotides (probes and primers) are also another object 
of the invention. 

Diagnostic kits comprising probes or amplification primers for the detection of 
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a microbial species or genus selected from the group consisting of Streptococcus 
species, Streptococcus agalactiae, Staphylococcus species, Staphylococcus 
saprophytics, Enterococcus species, Enterococcus faecium, Neisseria species, 
Neisseria meningitidis, Listeria monocytogenes, Candida species and Candida 
5 albicans are also objects of the present invention. 

Diagnostic kits further comprising probes or amplification primers for the 
detection of an antibiotic resistance gene selected from the group consisting of bla ttm , 
bla^, bla^ bla^,, blaZ, aadB, aacC1, aacC2, aacC3, aacA4, aac6'-lla, ermA, ermB, 
ermC, mecA, vanA, vanB, vanC, satA, aac(6')-aph(2"), aad(6'), vat, vga, msrA, sul and 
1 0 int are also objects of this invention. 

Diagnostic kits further comprising probes or amplification primers for the 
detection of any bacterial or fungal species, comprising or not comprising those for the 
detection of the specific microbial species or genus listed above, and further 
comprising or not comprising probes and primers for the antibiotic resistance genes 
1 5 listed above, are also objects of this invention. 

In a preferred embodiment, such a kit allows for the separate or the 
simultaneous detection and identification of the above-listed microbial species or 
genus, antibiotic resistance genes and for the detection of any bacterium. 

In the above methods and kits, amplification reactions may include a) 
20 polymerase chain reaction (PCR), b) ligase chain reaction, c) nucleic acid sequence- 
based amplification, d) self-sustained sequence replication, e) strand displacement 
amplification, f) branched DNA signal amplification, g) transcription-mediated 
amplification, h) cycling probe technology (CPT) i) nested PCR, or j) multiplex PCR. 
In a preferred embodiment, a PCR protocol is used as an amplification reaction. 
25 In a particularly preferred embodiment, a PCR protocol is provided, comprising, 

for each amplification cycle, an annealing step of 30 seconds at 45-55°C and a 
denaturation step of only one second at 95°C, without any time allowed specifically for 
the elongation step. This PCR protocol has been standardized to be suitable for PCR 
reactions with all selected primer pairs, which greatly facilitates the testing because 
30 each clinical sample can be tested with universal, species-specific, genus-specific and 
antibiotic resistance gene PCR primers under uniform cycling conditions. Furthermore, 
various combinations of primer pairs may be used in multiplex PCR assays. 

We aim at developing a rapid test or kit to discard rapidly all the samples which 
are negative for bacterial cells and to subsequently detect and identify the above 
35 bacterial and/or fungal species and genera and to determine rapidly the bacterial 
resistance to antibiotics. Although the sequences from the selected antibiotic 
resistance genes are available from databases and have been used to develop DNA- 
based tests for their detection, our approach is unique because it represents a major 
improvement over current gold standard diagnostic methods based on bact rial 
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cultures. Using an amplification method for the simultaneous bacterial detection and 
identification and antibiotic resistance genes detection, there is no need for culturing 
the clinical sample prior to testing. Moreover, a modified PCR protocol has been 
developed to detect all target DNA sequences in approximately one hour under 
uniform amplification conditions. This procedure will save lives by optimizing treatment, 
will diminish antibiotic resistance because less antibiotics will be prescribed, will reduce 
the use of broad spectrum antibiotics which are expensive, decrease overall health 
care costs by preventing or shortening hospitalizations, and decrease the time and 
costs associated with clinical laboratory testing. 

In the methods and kits described herein below, the oligonucleotide probes and 
amplification primers have been derived from larger sequences (i.e. DNA fragments 
of at least 100 base pairs). All DNA fragments have been obtained either from 
proprietary fragments or from databases. DNA fragments selected from databases are 
newly used in a method of detection according to the present invention, since they 
have been selected for their diagnostic potential. 

It is clear to the individual skilled in the art that other oligonucleotide sequences 
appropriate for (i) the universal bacterial detection, (ii) the detection and identification 
of the above microbial species or genus and (iii) the detection of antibiotic resistance 
genes other than those listed in Annex VI may also be derived from the proprietary 
fragments or selected database sequences. For example, the oligonucleotide primers 
or probes may be shorter or longer than the ones we have chosen; they may also be 
selected anywhere else in the proprietary DNA fragments or in the sequences selected 
from databases; they may be also variants of the same oligonucleotide. If the target 
DNA or a variant thereof hybridizes to a given oligonucleotide, or if the target DNA or 
a variant thereof can be amplified by a given oligonucleotide PCR primer pair, the 
converse is also true; a given target DNA may hybridize to a variant oligonucleotide 
probe or be amplified by a variant oligonucleotide PCR primer. Alternatively, the 
oligonucleotides may be designed from any DNA fragment sequences for use in 
amplification methods other than PCR. Consequently, the core of this invention is the 
identification of universal, species-specific, genus-specific and resistance gene-specific 
genomic or non-genomic DNA fragments which are used as a source of specific and 
ubiquitous oligonucleotide probes and/or amplification primers. Although the selection 
and evaluation of oligonucleotides suitable for diagnostic purposes requires much 
effort, it is quite possible for the individual skilled in the art to derive, from the selected 
DNA fragments, oligonucleotides other than the ones listed in Annex VI which are 
suitable for diagnostic purposes. When a proprietary fragment or a database sequence 
is selected for its specificity and ubiquity, it increases the probability that subsets 
thereof will also be specific and ubiquitous. 

Since a high percentage of clinical specimens are negative for bacteria (Table 
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3) DNA fragments having a high potential for the selection of universal oligonucleotide 
probes or primers were selected from proprietary and database sequences. The 
amplification primers were selected from a gene highly conserved in bacteria and 
fungi and are used to detect the presence of any bacterial pathogen in clinical 
specimens in order to determine rapidly (approximately one hour) whether it is positive 
or negative for bacteria. The selected gene, designated tuf, encodes a protein (EF-Tu) 
involved in the translational process during protein synthesis. The tuf gene sequence 
alignments used to derive the universal primers include both proprietary and database 
sequences (Example 1 and Annex I). This strategy allows the rapid screening of the 
numerous negative clinical specimens (around 80% of the specimens received, see 
Table 3) submitted for bacteriological testing. Tables 4. 5 and 6 provide a list of the 
bacterial or fungal species used to test the specificity of PCR primers and UNA probes 
Table 7 gives a brief description of each species-specific, genus-specific and umversal 
amplification assays which are objects of the present invention. Tables 8, 9 and 10 
> provide some relevant information about the proprietary and database sequences 
selected for diagnostic puposes. 

nPTAILED DF^C RIP TION OF THF INVENTION 

0 £ e sM an^_ fl er ^r r ifir PNA probes anri amplification P"mers fo r, 
mjrrnnrqanisms 

^^ jSnL ^ b 3se^ofs^prn- n.itnhlo for Hia nnosfc purposes 

In order to select sequences which are suitable for species-speafic or genus- 
specific detection and identification of bacteria or fungi or, alternatively, for the 
-5 universal detection of bacteria, the database sequences (GenBank. EMBL and Swiss- 
Prot) were chosen based on their potential for diagnostic purposes according o 
sequence information and computer analysis performed with these sequences. Inrtial* 
all sequence data available for the targeted microbial species or genus were carefu ly 
analyzed. The gene sequences which appeared the most promising for d,agnos tic 
30 purposes based on sequence information and on sequence comparisons w,th he 
^ corresponding gene in other microbial species or genera performed with the Genetics 
Computer Group (GCG, Wisconsin) programs were selected for testing by FCFL 
Optima. PCR amplification primers were chosen from the selected database 
sequences with the help of the Oligo~ 4.0 primer analysis software , (Nationa 
35 Biosciences Inc., Plymouth, Minn.). The chosen primers were tested in PCR assays 
" for their specificity and ubiquity for the target microbia. species or genus. In genera, 
the identification of database sequences from which amplification primers surtable for 
species-specific or genus-specrtic detection and identification were selected involved 
the computer analysis and PCR testing of several candidate gene sequences before 
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obtaining a primer pair which is specific and ubiquitous for the target microbial species 
or genus. Annex VI provides a list of selected specific and ubiquitous PGR primer 
pairs. Annexes I to V and Examples 1 to 4 illustrate the strategy used to select genus- 
spectfic, species-specific and universal PGR primers from fi/f sequences or from the 
recA gene. 

Oligonucleotide primers and nmh«. g H a , f qn anri * Yr ^ r 

, J*.* fra9mentS Sequenced b V us or elected from databases (GenBank 
and EMBL) were used as sources of oligonucleotides for diagnostic purposes For this 
sfrategy. an array of suitable oligonucleotide primers or probes derived from a variety 
of genomic DNA fragments (s*e of more than 100 bp) selected from databases were 
tested for their specificity and ubiquity in PGR and hybridization assays as described 
later. It is important to note that the database sequences were selected based on their 
potential for being species-specific, genus-specific or universal for the detection of 
bacteria or fungi according to available sequence information and extensive analysis 
and that, .n general, several candidate database sequences had to be tested in order 
to obtain the desired specificity, ubiquity and sensitivity. 

Oligonucleotide probes and amplification primers derived from species-specific 
fragments selected from database sequences were synthesized using an automated 
DNA synthesizer (Perkin-Elmer Corp., App.ied Biosystems Division). Prior to synthesis 
all oligonucleotides (probes for hybridization and primers for DNA amplification) were 
evaluated for their suitability for hybridization or DNA amplification by polymerase 
cham reaction (PGR) by computer analysis using standard programs (i.e. the Genetics 
Computer Group (GCG) programs and the primer analysis software Oligo™ 4 0) The 
potential suitability of the PGR primer pairs was also evaluated prior to the synthesis 
by verifying the absence of unwanted features such as long stretches of one nucleotide 
and a h,gh proportion of G or C residues at the 3" end (Persing et a,., 1993, Diagnostic 
Molecular Microbiology: Principles and Applications, American Society for 
Microbiology, Washington, D.C.). 

The oligonucleotide primers or probes may be derived from either strand of the 
duplex DNA. The primers or probes may consist of the bases A, G, C, or T or analogs 
and they may be degenerated at one or more chosen nucleotide position(s) The 

TtZ 7 Pr ° beS 56 ° f SUftab,e ' ength and <"* be se,ected ^here within 
the DNA sequences from proprietary fragments or from selected database sequences 
which are suitable for (i) the universa. detection of bacteria, (ii) the species-specific 
detection and identification of Enterococcus faecium. Listeria monocytogenes 
Ne t ssena meningitidis, Staphylococcus saprophyticus, Streptococcus agalactiae and 
Candida albicans (iii) the genus-specific detection of Streptococcus species, 
Enterococcus species, Sfapny/ococcus species and Neisseria species or (iv) the 
detection of the 26 above-mentioned clinically important antibiotic resistance genes 
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Variants for a given target bacterial gene are naturally occurring and are 
attributable to sequence variation within that gene during evolution (Watson et a/., 
1987 Molecular Biology of the Gene. 4* ed., The Benjamin/Cummings Publishing 
Company Menlo Park. CA; Lewin, 1989. Genes IV. John Wiley & Sons. New York. 
5 NY) For example, different strains of the same bacterial species may have a single or 
more nucleotide variation(s) at the oligonucleotide hybridization site. The person skilled 
in the art is well aware of the existence of variant bacterial or fungal DMA sequences 
for a specific gene and that the frequency of sequence variations depends on the 
selective pressure during evolution on a given gene product. The detection of a variant 
1 o sequence for a region between two PCR primers may be demonstrated by sequenang 
the amplification product. In order to show the presence of sequence variants at the 
primer hybridization site, one has to amplify a larger DNA target with PCR primers 
outside that hybridization site. Sequencing of this larger fragment will allow the 
detection of sequence variation at this site. A similar strategy may be applied to show 
15 variants at the hybridization site of a probe. Insofar as the divergence of the target 
sequences or a part thereof does not affect the specificity and ubiquity of the 
amplification primers or probes, variant bacterial DNA is under the scope of th,s 
invention. Variants of the selected primers or probes may also be used to amplrfy or 
hybridize to a variant DNA. 
20 s yncing of tuf^ mmcf from t vwty of burial and fungal specie* 

The nucleotide sequence of a portion of tuf genes was determined for a vanety 
of bacteria, and fungal species. The amplification primers SEQ ID NOs: 107 and 108. 
which amplify a tuf gene portion of approximately 890 bp. ^re used for he 
sequencing of bacterial tuf sequences. The amplification primers SEQ ID NOs. 109 
25 and 172. which amplify a tuf gene portion of approximately 830 bp, were used for the 
sequencing of fungal tuf sequences. Both primer pairs can amplify tuf A and MB 
genes. This is not surprising because these two genes are nearly idenucal. For 
example, the entire MA and tufB genes from E. coli differ at only 13 nucleotide 
positions (Neidhardt ef a/.. 1996. Escherichia coli and Salmonella: Cellular and 
30 Molecular Biology. 2- ed.. American Society for Microbiology Press. Washmgton 
DC) These amplification primers are degenerated at several nucleotide positions and 
contain inosines in order to allow the amplification of a wide range of tuf sequences. 
The strategy used to select these amplification primers is similar to that .llustrated in 
Annex I for the selection of universal primers. The amplification primers SEQ ID NOs: 
35 107 and 108 could be used to amplify the tuf genes from any bacterial spec.es. The 
amplification primers SEQ ID NOs: 109 and 172 couid be used to amplify the tuf genes 

from any fungal species. 

The tuf genes were amplified directly from bacterial or yeast cultures us.ng the 
following amplification protocol: One fA. of cell suspension was transferred directly to 



SUBSTITUTE SHEET (RULE 26) 



10 



15 



20 



25 



WO 98/20157 

PCT/CA97/00829 - 

-12- 

19 mL of a PCR reaction mixture containing 50 mM KCI, 10 mM Tris-HCI (pH 9 0) 
01% Triton X-100, 2.5 mM MgCI,, 1 of each of the 2 primers, 200 nM of each of 
the four dNTPs, 0.5 unit of Tag DNA polymerase (Promega Corp., Madison. Wl) PCR 
reactions were subjected to cycling using a MJ Research PTC-200 thermal cycler (MJ 
Research Inc.. Watertown, Mass.) as follows: 3 min at 96»C followed by 30-35 cycles 
of 1 m.n at 95°C for the denaturation step, 1 min at 30-50'C for the annealing step and 
1 mm at 72°C for the extension step. Subsequently, twenty microliters of the PCR- 
amplified m.xture were resolved by electrophoresis in a 1 .5% agarose gel. The gel was 
then v.sualized by staining with methylene blue (Flores et a/.. 1992. Biotechniques 
13:203-205). The size of the amplification products was estimated by comparison with 
a 100-b P molecular weight ladder. The band corresponding to the specific amplification 
product (i.e. approximately 890 or 830 bp for bacterial or fungal fc/f sequences 
respectwely) was excised from the agarose gel and purified using the QIAquick™ qel 
extraction kit (QIAGEN Inc.. Chatsworth, CA). The gel-purified DNA fragment was then 
used directly in the sequencing protocol. Both strands of the tuf genes amplification 
product were sequenced by the dideoxynucleotide chain termination sequencing 
method by using an Applied Biosystems automated DNA sequencer (model 373A) with 
the.r PRISM™ Sequenase* Terminator Double-stranded DNA Sequencing Kit (Perkin- 
Elmer Corp.. Applied Biosystems Division, Foster City, CA). The sequencing reactions 
were all performed by using the amplification primers (SEQ ID NOs: 107 to 109 and 
172) and 100 ng per reaction of the gel-purified amplicon. In oraer to ensure that the 
determmed sequence did not contain errors attributable to the sequencing of PCR 
artefacts, we have sequenced two preparations of the gel-purified tuf amplification 
product originating from two independent PCR amplifications. For all target microbial 
species, the sequences determined for both amplicon preparations were identical 
Furthermore, the sequences of both strands were 100% complementary thereby 
confirmmg the high accuracy of the determined sequence. The tuf sequences 
determmed using the above strategy are all in the Sequence Listing (i.e SEQ ID 
NOs:118 to 146). Table 13 gives the originating microbial species and the source for 
each tuf sequence in the Sequence Listing. 

The alignment of the tuf sequences determined by us or selected from 
databases reveals clearly that the length of the sequenced portion of the fcrf genes is 
variable. There may be insertions or deletions of several amino acids. This explains 
why the s.ze of the sequenced tuf amplification product was variable for both bacterial 
and fungal species. Among the fof sequences determined by our group we found 
mserfons and deletions adding up to 5 amino acids or 15 nucleotides. Consequently 
th nucleot.de positions indicated on top of each of Annexes I to V do not correspond 
for tuf sequences having insertions or deletions. 

It should also be noted that the various tuf sequences determined by us 
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occasionally contain degenerescences. These degenerated nucleotides correspond 
to sequence variations between tuf A and MB genes because the amplification primers 
amplify both tuf genes. These nucleotide variations were not attributable to nucleotide 
reincorporations by the taq DNA polymerase because the sequence of both strands 
5 were identical and also because the sequences determined with both preparations of 
the gel-purified tuf amplicons were identical, 
j jjg election a i ~rr' if "- at i n " primers from resequences 

The fof sequences determined by us or selected from databases were used to 
select PCR primers for (i) the universal detection of bacteria, (ii) the genus-specific 
10 detection and identification of Enterococcus spp. and Staphylococcus spp. and (iii) the 
species-specific detection and identification of CandWa albicans. The strategy used to 
select these PCR primers was based on the analysis of multiple sequence alignments 
of various tuf sequences. For more details about the selection of PCR primers from tuf 
sequences, please refer to Examples 1 to 3 and Annexes I to IV. 
15 Thft flection »f amplification nrimers from recA 

The comparison of the nucleotide sequence for the recA gene from various 
bacterial species including 5 species of streptococci allowed the selection of 
Sfrepfococcus-specific PCR primers. For more details about the selection of PCR 
primers from recA. please refer to Example 4 and Annex V. 
20 nuA foment I s olation, from Staphylococcus sapro p hytic,!* by arb.trar.ly 

primed PCR 

DNA sequences of unknown coding potential for the species-specific detecfion 
and identification of Staphylococcus saprophytics were obtained by the method of 
arbitrarily primed PCR (AP-PCR). 
25 AP-PCR is a method which can be used to generate specific DNA probes for 

microorganisms (Fani et a/., 1993, MoL Ecol. 2:243-250). A description of the AP-PCR 
protocol used to isolate a species-specific genomic DNA fragment from 
Staphylococcus saprophytics follows. Twenty different oligonucleotide primers of 10 
nucleotides in length (all included in the AP-PCR kit OPAD (Operon Technolog.es, Inc 
30 Alameda, CA» were tested systematically with DNAs from 3 bacterial stra.ns of 
Stepnyfococcus saprophytics (all obtained from the American Type Culture Collect.on 
(ATCC)- numbers 15305. 35552 and 43867) as well as with DNA from four other 
staphylococcal species [Staphylococcus aureus ATCC 25923. Staphylococcus 
epidarmidis ATCC 14990, Sfapny/ococcus haemolyticus ATCC 29970 and 
35 Staphylococcus hominis ATCC 35982). For all bacterial species, amplification was 
performed from a bacterial suspension adjusted to a standard 0.5 McFarland wh.ch 
corresponds to approximately 1.5 x 10> bacteria/mL. One „L of the standard^ 
bacterial suspension was transferred directly to 19 „L of a PCR reaction mixture 
containing 50 mM KCi, 10 mM Tris-HCI (pH 9.0). 0.1% Triton X-100. 2.5 mM MgCI,. 
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1.2 M M of only one of the 20 different AP-PCR primers OPAD, 200 M M of each of the 
four dNTPs and 0.5 unit of Tag DNA polymerase (Promega Corp., Madison, Wl). PCR 
reactions were subjected to cycling using a MJ Research PTC-200 thermal cycler (MJ 
Research Inc.) as follows: 3 min at 96»C followed by 35 cycles of 1 min at 95°C for the 
5 denaturation step, 1 min at 32°C for the annealing step and 1 min at 72'C for the 
extension step. A final extension step of 7 min at 72"C was made after the 35 cycles 
to ensure complete extension of PCR products. Subsequently, twenty microliters of the 
PCR amplified mixture were resolved by electrophoresis in a 2% agarose gel 
containing 0.25 ^g/mL of ethidium bromide. The size of the amplification products was 
1 0 estimated by comparison with a 50-bp molecular weight ladder. 

Amplification patterns specific for Staphylococcus saprophytics were observed 
with the AP-PCR primer OPAD-9 (SEQ ID NO: 25). Amplification with this primer 
consistently showed a band corresponding to a DNA fragment of approximately 450 
bp for all Staphylococcus saprophytics strains tested but not for any of the four other 
staphylococcal species tested. This species-specific pattern was confirmed by testing 
10 more clinical isolates of S. saprophytics selected from the culture collection of the 
microbiology laboratory of the CHUL as well as strains selected from the gram-positive 
bacterial species listed in Table 5. 

The band corresponding to the approximately 450 bp amplicon which was 
specific and ubiquitous for S. saprophytics based on AP-PCR was excised from the 
agarose gel and purified using the QIAquick™ gel extraction kit (QIAGEN Inc.). The 
gel-purified DNA fragment was cloned into the T/A cloning site of the pCR 2.1™ 
plasmid vector (Invitrogen Inc.) using T4 DNA ligase (New England BioLabs). 
Recombinant plasmids were transformed into E. coli DH5a competent cells using 
25 standard procedures. Plasmid DNA isolation was done by the method of Birnboim and 
Doly (Nucleic Acids Res. 7:1513-1523) for small-scale preparations. All plasmid DNA 
preparations were digested with the EcoRI restriction endonuclease to ensure the 
presence of the approximately 450 bp AP-PCR insert into the recombinant plasmids. 
Subsequently, a large-scale and highly purified plasmid DNA preparation was 
performed from two selected clones shown to carry the AP-PCR insert by using the 
QIAGEN plasmid purification kit. These plasmid preparations were used for automated 
DNA sequencing. 

Both strands of the AP-PCR insert from the two selected clones were 
sequenced by the dideoxynucleotide chain termination sequencing method with SP6 
and T7 sequencing primers, by using an Applied Biosystems automated DNA 
sequencer as described previously. The analysis of the obtained sequences revealed 
that the DNA sequences for both strands from each clone were 100% complementary 
Furthermore, it showed that the entire sequence determined for each clone were both 
identical. These sequencing data confirm the 100% accuracy for the determined 438 
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bp sequence (SEQ ID NO: 29). Optimal amplification primers have been selected from 
the sequenced AP-PCR Sfapby/ococcus saprophytics DNA fragment with the help 
of the primer anafysis software Oligo™ 4.0. The selected primer sequences have been 
tested in PCR assays to verify their specificity and ubiquity (Table 7). These PCR 
primers were specific since there was no amplification with DNA from bacterial speaes 
other than S. saprophytics selected from Tables 4 and 5. Furthermore, this assay was 
ubiquitous since 245 of 260 strains of S. saprophytics were efficiently amplified *A 
this PCR assay. When used in combination with another S. saprophyticus-spevtc 
PCR assay, which is an object of our co-pending U.S. (N.S. 08/526.840) and PCT 
(PCT/CA/95/00528) patent applications, the ubiquity reaches 100% for these 260 
strains. 

nNAa Fo n r DNA amplification by the wide* used PCR (polymerase chain reaction) 
method, primer pairs were derived from proprietary DNA fragments or from database 
i sequences Prior to synthesis, the potential primer pairs were analyzed by us.ng the 

During DNA amplification by PCR. two ofigonucleotide pnmers b,nd,ng 
respectively to each strand of the heat-denatured target DNA from the bactena. 
enome are used to amplify exponential, in vitro the target DNA by success- 
0 thermal cycles allowing denaturation of the DNA. annealing of the pnrne s and 
synlsis of new targets at each cycle (Persing ef a/. 1993. Diagnosfic Molecular 
M^gy kelpies and Applications. American Society for Microb.ology. 
Washington^ , D.C.). ^ ^ ^ ^ ^ specjmens 

standardized bacteria, or fungal suspensions (see be.ow) were amplify h ■ 2M- 
PCR reaction mixture containing 50 mM KCI. 10 mM Tris-HC. ( P H 9.0). 2.5 mM MgCI 
0.4 ^ of each primer. 200 mM of each of the four dNTPs and 0.5 un t of Tag DNA 
polyn^P^ 

Inc Palo Alto. CA). The TaqStart™ antibody, which is a neutral.z.ng monoclonal 
antibody to Taq DNA polymerase, was added to all PCR reactions to enhance the 
specificity and the sensitivity of the amplifications (Kellogg ef af.. 1994. B,otechn,ques 
16-1134-1137) The treatment of the clinical specimens varies with the type or 
specimen tested, since the composition and the sensitivity level required are d.fferent 
for each specimen type. It consists in a rapid protocol to lyse the bactena. cells and 
eliminate the PCR inhibitory effects (see example 1 1 for urine 
For amplification from bacterial or fungal cultures, the samples were added Mfrto 
the PCR amplification mixture without any pre-treatment step (see example 10). Pnmer 
sequences derived from highly conserved regions of the bacterial 16S ribosoma RNA 
gene were used to provide an internal control for all PCR reactions. Alternatively, the 
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internal control was derived from sequences not found in microorganisms or in the 
human genome. The internal control was integrated into all amplification reactions to 
verify the efficiency of the PCR assays and to ensure that significant PCR inhibition 
was absent. The internal control derived from rRNA was also useful to monitor the 
efficiency of bacterial lysis protocols. 

PCR reactions were then subjected to thermal cycling (3 min at 95°C followed 
by 30 cycles of 1 second at 95°C for the denaturation step and 30 second at 55"C for 
the annealing-extension step) using a PTC-200 thermal cycler (MJ Research Inc ) and 
subsequently analyzed by standard ethidium bromide-stained agarose gel 
electrophoresis. The number of cycles performed for the PCR assays varies according 
to the sensitivity level required. For example, the sensitivity level required for microbial 
detecbon directly from clinical specimens is higher for blood specimens than for urine 
specmens because the concentration of microorganisms associated with a septicemia 
can be much lower than that associated with a urinary tract infection. Consequently 
more sensitive PCR assays having more thermal cycles are required for direct 
detect.cn from blood specimens. Similarly. PCR assays performed directly from 
bacterial or fungal cultures may be less sensitive than PCR assays performed directly 
from cl.n,cal specimens because the number of target organisms is normally much 
lower in clinical specimens than in microbial cultures. 

It is clear that other methods for the detection of specific amplification products 
wh.ch may be faster and more practical for routine diagnosis, may be used Such 
methods may be based on the detection of fluorescence after amplification (eg 
TaqMan™ system from Perkin Elmer or Amplisensor™ from Biotronics). Methods 
based on the detection of fluorescence are particularly promising for utilization in 
routme diagnosis as they are very rapid, quantitative and can be automated (Example 

Microbial pathogens detection and identification may also be performed by solid 
support or liquid hybridization using species-specific internal DNA probes hybridizing 
to an amplification product. Such probes may be generated from any species-specific 
or genus-specific DNA amplification products which are objects of the present 
invention. Alternatively, the internal probes for species or genus detection and 
■dentation may be derived from the amplicons produced by the universal 
amplication assay. The oligonucleotide probes may be labeled with biotin or with 
digoxigenin or with any other reporter molecules. 

To assure PCR efficiency, glycerol, dimethyl sulfoxide (DMSO) or other related 
solvents can be used to increase the sensitivity of the PCR and to overcome problems 
associated with the amplification of a target DNA having a high GC content or forming 
strong secondary structures (Dieffenbach and Dveksler, 1995, PCR Primer ■ A 
Laboratory Manual, Cold Spring Harbor Laboratory Press. Plainview, New York) The 
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concentration ranges for giycerol and DMSO are 5-15% (v/v) and 3-10% (v/v). 
respectively For the PCR reaction mixture, the concentration ranges for the 
amplification primers and MgCI, are 0.1-1.5 //M and 1.5-3.5 nil, respective*. 
Modifications of the standard PCR protocol using external and nested pnmers (..e. 
5 nested PCR) or using more than one primer pair (i.e. multiplex PCR) may also be used 
(Persing ef a/ 1993, Diagnostic Molecular Microbiology: Principles and Appl.cat.ons, 
American Society for Microbiology. Washington, D.C.). For more details about the PCR 
protocols and amplicon detection methods, see Examples 9 to 14. 

The person skilled in the art of DNA amplification knows the existence of other 
10 rapid amplification procedures such as ligase chain reaction (LCR). transcription- 
" mediated amplification (TMA). self-sustained sequence replication (3SR). nucle.c acd 
sequence-based amplification (NASBA), strand displacement amplification (SDA), 
branched DNA (bDNA) and cycling probe technology (CPT) (Lee ef a/., 1997. Nucle.c 
Acid Amplification Technologies: Application to Disease Diagnosis, Eaton Publ.sh.ng 
15 Boston. MA ; Persing ef a/.. 1993, Diagnostic Molecular Microbiology: Pnnap.es and 
" Applications. American Society for Microbiology. Washington. D.C.). The scope of th,s 
invention is not limited to the use of amplification by PCR, but rather .ncludes the use 
of any rapid nucleic acid amplification method or any other procedure wh.ch may be 
used to increase rapidity and sensitivity of the tests. Any oligonucleotide su, able for 
20 the amplification of nucleic acids by approaches other than PCR and denved from he 
" species-specific, genus-specfic and universal DNA fragments as well as rom selected 
antibiotic resistance gene sequences included in this document are also under the 
scope of this invention. 

u Y KrjHir a tinn ac« a yg with oligonucleotide probe? 
25 , n hybridization experiments, single-stranded oligonucleotides (s.ze less than 

^ 100 nucleotides) have some advantages over DNA fragment probes for the detection 
of bacteria, such as ease of synthesis in large quantities, consistency in results rom 
batch to batch and chemicai stability. Briefly, for the hybridizations, oligonucleotides 
were 5' end-labeled with the radionuclide Y -»P(dATP) using T4 polynucleotide 
30 kinase (Pharmacia) (Sambrook et a/., 1989, Molecular Cloning: A LMV*«« 
2- ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY . The 
unincorporated radionucleotide was removed by passing the labeled oligonucleotide 
through a Sephadex G-50™ column. Alternatively, oligonucleotides were labeled wrth 
biotin. e*her enzymatically at their 3 ends or incorporated directly ^"9 synthes« at 
35 their 5' ends, or with digoxigenin. It will be appreciated by the person skilled .n the art 
that labeling means other than the three above labels may be used. 

Each oligonucleotide probe was then tested for its specificity by hybr.d.zat.on 
to DNAs from a variety of bacteria, and fungal species selected from Tables 4, 5 and 
6 All of the bacteria, or fungal species tested were likely to be pathogens assocated 
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with common infections or potential contaminants which can be isolated from clinical 
specimens. Each target DNA was released from bacterial cells using standard 
chermcal treatments to lyse the ceils (Sambrook et a/., 1989, Molecular Cloning- A 
Laboratory Manual. 2-ed.. Cold Spring Harbor Laboratory Press, Cold Spring Harbor 
NY). Subsequently, the DNA was denatured by conventional methods and then 
.rrevers.bly fixed onto a solid support (e.g. nylon or nitrocellulose membranes) or free 
in solution. The fixed single-stranded target DNAs were then hybridized with the 
ohgonucleotide probe cells (Sambrook era/., 1989, Molecular Cloning: A Laboratory 
Manual, 2" ed.. Cold Spring Harbor Laboratory Press, Cold Spring Harbor NY) Pre- 
hybndization conditions were in 1 M NaCI + 10% dextran sulfate + 1% SDS + 100 
^g/mL salmon sperm DNA at 65'C for 15 min. Hybridization was performed in fresh 
pre-hybndization solution containing the labeled probe at 65"C overnight Post- 
hybridization washing conditions were as follows: twice in 3X SSC containing 1% SDS 
twice in 2X SSC containing 1% SDS and twice in 1X SSC containing 1% SDS (all of 
these washes were at 65 C for 15 min), and a final wash in 0.1X SSC containing 1% 
SDS at 25'C for 15 min. Autoradiography of washed filters allowed the detection of 
selectively hybridized probes. Hybridization of the probe to a specific target DNA 
md.cated a high degree of similarity between the nucleotide sequence of these two 
DNAs because of the high stringency of the washes. 

An oligonucleotide probe was considered specific only when it hybridized solely 
to DNA from the species or genus from which it was isolated. Oligonucleotide probes 
found to be specific were subsequently tested for their ubiquity (i.e. ubiquitous probes 
recogmzed most or all isolates of the target species or genus) by hybridization to 
m.crob,al DNAs from clinical isolates of the species or genus of interest including 
ATCC strams. The DNAs from strains of the target species or genus were denatured 
fixed onto nylon membranes and hybridized as described above. Probes were 
considered ubiquitous when they hybridized specifically with the DNA from at least 
80% of the isolates of the target species or genus. 

specificity and ubiquity tests for aUganygiaafi ri a r rimsr* an n msbQS 

The specificity of oligonucleotide primers and probes, derived either from the 
DNA fragments sequenced by us or selected from databases, was tested by 
amplification of DNA or by hybridization with bacterial or fungal species selected from 
those listed in Tables 4, 5 and 6. as described in the two previous sections 
Oligonucleotides found to be specific were subsequently tested for their ubiquity by 
ampliation (for primers) or by hybridization (for probes) with bacterial DNAs from 
|solates of the target species or genus. Results for specificity and ubiquity tests with 
th ohgonucleotide primers are summarized in Table 7. The specificity and ubiquity of 
the PCR assays using the selected amplification primer pairs were tested directly from 
cultures (see Examples 9 and 1 0) of bacterial or fungal species 
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„«h common W o, Potential contaminant which can be Mated from clintea, 
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The various species-specific and genus-specific PCR assays which are objects 
of the present invention are all specific. For the PCR assays specific to bacterial 
species or genus, this means that DNA isolated from a wide variety of bacterial 
species, other than that from the target species or genus and selected from Tables 4 
5 and 5, could not be amplified. For the PCR assay specific to Candida albicans it 
means there was no amplification with genomic DNA from the fungal species listed in 
Table 6 as well as with a variety of bacterial species selected from Tables 4 and 5. 

The various species-specific and genus-specific PCR assays which are objects 
of the present invention are also all ubiquitous (Table 7). (i) The species-specific PCR 
10 assays for E. faecium, L. monocytogenes, S. saprvphyticus, S. agalactiae and C 
albwans amplified genomic DNA from all or most strains of the target species tested 
which were obtained from various sources and which are representative of the diversity 
within each target species (Table 7). The species identification of all of these strains 
was based on classical biochemical methods which are routinely used in clinical 
15 microbiology laboratories, (ii) The genus-specific PCR assays specific for 
Enterococcus spp., Staphylococcus spp.. Streptococcus spp. and Neisseria spp 
amplified genomic DNA from all or most strains of the target genus tested, which 
represent all clinically important bacterial species for each target genus. These strains 
were obtained from various sources and are representative of the diversity within each 
target genus. Again, the species identification of all of these strains was based on 
classical biochemical methods which are routinely used in clinical microbiology 
laboratories. More specifically, the four genus-specific PCR assays amplified the 
following species: (1) The Enfe/ococcus-specific assay amplified efficiently DNA from 
all of the 11 enterococcal species tested including E avium, E. casseliflavus, E dispar, 
E. durans, E. faecalis, E. faecium, E. flavescens, E. gallinarum, E. hirae, E. mundtii 
and E. raffinosus. (2) The /Ve/sseria-specific assay amplified efficiently DNA from all 
of the 12 neisseria! species tested including N. canis, N. cinerea, N. elongata, N 
flavescens, N. gonorrhoeae, N. lactamica, N. meningitidis, N. mucosa ' N. 
polysaccharea, N. sicca, N. subflava and N. weaveri. (3) The Sfapny/ococo/s-specific 
assay amplified efficiently DNA from 13 of the 14 staphylococcal species tested 
including S. aureus, S. auricularis, S. capitis, S. cohnii, S. epidermidis, S. 
haemolyticus, S. hominis, S. lugdunensis, S. saprvphyticus, S. schleiferi, S. simulans. 
S. wameri and S. xylosus. The staphylococcal species which could not be amplified 
is S. sciuri. (4) Finally, the Sfrepfococo/s-specific assay amplified efficiently DNA from 
all of the 22 streptococcal species tested including S. agalactiae, S. anginosus, S 
bovis, S. constellatus, S. crista, S. dysgalactiae, S. equi, S. gordonii, S. intermedius, 
S. m,tis, S. mutans, S. oralis, S. parasanguis, S. pneumoniae, S. pyogenes S 
salivarius, S. sanguis, S.sabrinus, S. suis, S. uberis, S. vestibularis and S. viridans. 
On the other hand, the Streptococcus-specific assay did not amplify 3 out of 9 strains 
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of S. nutans and 1 out of 23 strains of S. salivarius, thereby showing a slight lack of 
nbiauitv for these two streptococcal species. 

AU specify and ubiquitous amplication primers fo, each targe, 
soecies or genua or antibiotic resistance gene investigated are listed in Annex VI 
££Th me sequenced DNA tragnrente can occur, insora, as tea emergence o 
SLTsequences or a par, .hereof does no, affec, tee specialty o, the probes or 
Lolifroanon primers. Variant bactenal DNA is under the scope of Ihrs uwenbon. 

P te PC^arnpligcation pnmers listed in Annex V, were al, tested for tee,, 
specify and ubiquity using reference strains as well as dinted isolates fmm venous 
, rographlcal locations. The 35, reference sdaine used ,o tes, ,he ampU*ca,,on and 
h then assays (Tables 4. 5 and 6, were ob,ained from (i) ^ ~ Type 
Culture Coilecbon <ATCC): 65%. <t ,h. Labora,oi,e de sante Quebe 
(LSPOV 10% (SO the Centers fo, Disease Oonbol and Ptevenhon (CDC). 3 ^ M >he 
Sat Cultera Type Cottection (NCTC): ,% and (v) several other rafcrance 
5 Tories .hnaughou, ,he wodd: ,%. These ,e,e,ence Crates are representative o, 
; 90 g'nlnegaL bacteria, epedes ,,60 strains: Tabte 4). (ii) 97 gram^osdw 
balrspeotes „ 54 strains; Tabte 5, and (lit) ,2 fungal speefcs ,38 sba,ns; Table 6). 

,„ feaures lennore, ovenuse o, anbbtebc. inevitabfy teads ,o the emergence o 
ideua, rasisrance Our goal is ,o provide clinicians, in approxUnately one hour, tee 

n^t dinted specimens wdh DNA-based teste ,o, unweraa, badenal detecbon and 
o, tee prince o, a speed* pateogen in tee posiuve spea^fh 
„* !L.e S and/o, genus-specifc DNA-based teats, clinicians also need tenely 

We feel tea, tee most efficient strategy to evaluate rapxlly bactenal resistanM w 
^£2* ie to defeel dirediy from tee dinted spedmens the most ^mmon and 
important andbiotic resistance gene, (i.e. DNA-based teste ,or tee detecbon 
30 72 J res„ance genes,. Stece tee sequence >™*»^^ £ 
mmmon bacterial antibiotic resistance genes are available from databases, ou 

design sped.0 d*onuc,eobde pbmers o, probes which w» be used as ^ 
developmen, o, rapid DNA-based teste. The sequence bom each of tee bactenal 

inddenc* and importance) Is g'rven in tee Sequence UsUn* Wles 9 and 
,0 summarize some charadensbes of the selected andbiobc raastem* gene^Ou 
Load, rs unique because ,h. andbteic resistance genes detecbon and the badena 
rdn and fden«ce.ten era performed simu,faneously in mutfiplex assays unde, 
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uniform PCR amplification conditions (Example 13) 

imnnn Pr ° VideS 3 ° f a " amP ' ifiCati0n PrimerS Se,ected from 26 Really 

important ant.b.ot,c resistance genes which were tested in PCR assays. The various 

PCR assays for antibiotic resistance genes detection and identification were validated 
by testmg severa. resistant bacteria, isolates known to carry the targeted gene and 
obtamed from various countries. The testing of a large number of strains which do not 
cany the targeted resistance gene was also performed to ensure that all assays were 
specific, so far, all PCR assays for antibiotic resistance genes are highly specific and 
have detected all control resistant bacterial strains known to carry the targeted gene 
The results of some clinical studies to validate the array of PCR assays for the 
detect.cn and identification of antibiotic resistance genes and correlate these DMA- 
based assays with standard antimicrobials susceptibility testing methods are presented 
in Tables 11 and 12. 
Universal bacterial rioWti» n 

In the routine microbiology laboratory, a high percentage of clinical specimens 
sent for bacterial identification are negative by culture (Table 4). Testing clinical 
samples with unh/ersal amplification primers or universal probes to detect the presence 
of bacteria prior to specific identification and screen out the numerous negative 
specmens is thus useful as it saves costs and may rapidly orient the clinical 
management of the patients. Severa. amplification primers and probes were therefore 
synthesized from highly conserved portions of bacterial sequences from the tuf genes 
(Table 8). The universal primer selection was based on a muttiple sequence alignment 
constructed with sequences determined by us or selected from available database 
sequences as described in Example 1 and Annex I. 

For the identification of database sequences suitable for the universal detection 
of bacteria, we took advantage of the fact that the compiete genome sequences for 
two distant microorganisms (i.e. Mycoplasma genitaiium and Haemophilus influenzae) 
are available. A comparison of the amino acid sequence for all proteins encoded by 
the genome of these two distant microorganisms led to the identification of highly 
homologous proteins. An analysis of these homologous proteins allowed to select 
some promising candidates for the development of universal DNA-based assays for 
the detection of bacteria. Since the complete nucleotide sequence of severa. other 
m,crob,al genomes are present* available in databases, a person skilled in the art 
could anwe to the same conclusions by comparing genomes sequences other than 
those of Mycoplasma genitaiium and Haemophilus influenzae. The selected tuf gene 
encodes a protein (EF-Tu) involved in the translation process during protein synthesis 
Subsequently, an extensive nucleotide sequence analysis was performed with the tuf 
gene sequences available in databases as well as with novel tuf sequences which w 
have determined as described previously. All computer analysis of amino acid and 



SUBSTITUTE SHEET (RULE 26) 



PCT/CA97/00829 - 

WO 98/20157 

-22- 

nucleotide sequences were performed by using the GCG programs. Subsequently, 
optimal PCR primers for the universal amplification of bacteria were selected wrth the 
help of the Oligo™ program. The selected primers are degenerated at several 
nucleotide positions and contain several inosines in order to allow the amplification of 
5 all clinically relevant bacterial species (Annex I). Inosine is a nucleotide analog able to 
specifically bind to any of the four nucleotides A. C, G or T. Degenerated 
oligonucleotides consist of an oligonucleotide mix having two or more of the four 
nucleotides A C, G or T at the site of mismatches. The inclusion of inosine and/or of 
degenerescences in the amplification primers allow mismatch tolerance thereby 
10 permitting the amplification of a wider array of target nucleotide sequences 
(Dieffenbach and Dveksler, 1995 PCR Primer A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Plainview, NY). 

The amplification conditions with the universal primers were identical to those 
used for the species- and genus-specific amplification assays except that the 
15 annealing temperature was 50°C instead of 55°C. This universal PCR assay was 
specific and nearly ubiquitous for the detection of bacteria. The specificity for bactena 
was verified by amplifying genomic DNA isolated from the 12 fungal species listed in 
Table 6 as well as genomic DNA from Leishmania donovani, Sacoharomyces 
cerevisiae and human lymphocytes. None of the above eukaryotic DNA preparafions 
20 could be amplified by the universal assay, thereby suggesting that this test is specific 
for bacteria. The ubiquity of the universal assay was verified by amplifying genomic 
DNAs from 116 reference strains which represent 95 of the most clinically relevant 
bacterial species. These species have been selected from the bacterial species listed 
in Tables 4 and 5. We found that 104 of these 116 strains could be amplified. The 
25 bacterial species which could not be amplified belong to the following genera: 
Corynebactertum (11 species) and Sfenofrophomonas (1 species). Sequencing of the 
tuf genes from these bacterial species has been recently performed. This sequencing 
data has been used to select new universal primers which may be more ub.qu.tous 
These primers are in the process of being tested. We also observed that for several 
30 species the annealing temperature had to be reduced to 45'C in order to get an 
efficient amplification. These bacterial species include Gemella morbilbwm, Us ena 
spp (3 species) and Gartnerella vaginalis. It is important to note that the 95 bacterial 
species selected from Tables 4 and 5 to test the ubiquity of the universal assay include 
all of the most clinically relevant bacterial species associated with a variety of human 
35 infections acquired in the community or in hospitals (nosocomial infections). The most 
clinically important bacterial and fungal pathogens are listed in Tables 1 and 2. 
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EXAMPLES A ND ANNFyF S 

The following examples and annexes are intended to be illustrative of the 
various methods and compounds of the invention, rather than limiting the scope 
thereof. 

The various annexes show the strategies used for the selection of amplification 
pnmers from M sequences or from the recA gene: (i) Annex I illustrates the strategy 
used for the selection of the universal amplification primers from hvf sequences (ii) 
Annex II shows the strategy used for the selection of the amplification primers specific 
for the genus Enterococcus from tuf sequences, (iii) Annex III illustrates the strategy 
used for the selection of the amplification primers specific for the genus 
Staphylococcus from ft/f sequences, (iv) Annex IV shows the strategy used for the 
selection of the amplification primers specific for the species Candida albicans from tuf 
sequences, (v) Annex V illustrates the strategy used for the selection of the 
amplication primers specific for the genus Streptococcus from mcA sequences (vi) 
Annex VI gives a list of all selected primer pairs. As shown in these annexes, the 
selected amplification primers may contain inosines and/or degenerescences. Inosine 
is a nucleotide analog able to specifically bind to any of the four nucleotides A, C, G 
or T. Alternatively, degenerated oligonucleotides which consist of an oligonucleotide 
mix having two or more of the four nucleotides A. C, G or T at the site of mismatches 
were used. The inclusion of inosine and/or of degenerescences in the amplification 
primers allow mismatch tolerance thereby permitting the amplification of a wider array 
of target nucleotide sequences (Dieffenbach and Dveksler. 1995 PCR Primer A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Plainview, New York) 
EXAMPLES 
EXAMPLE 1 : 

Selection of universal PCR nrimers from t,,f*^ ,»^ As snown in , 
the comparison of tuf sequences from a variety of bacterial and eukaryotic species 
allowed the selection of PCR primers which are universal for the detection of bacteria 
The strategy used to design the PCR primers was based on the analysis of a multiple 
sequence alignment of various fc/f sequences. This multiple sequence alignment 
includes tuf sequences from 38 bacterial species and 3 eukaryotic species either 
determined by us or selected from databases (Table 13). A careful analysis of this 
multiple sequence alignment allowed the selection of primer sequences which are 
conserved within eubacteria but which discriminate sequences from eukaryotes, 
thereby permitting the universal detection of bacteria. As shown in Annex I, the 
selected primers contain several inosines and degenerescences. This was necessary 
because there is a relatively high polymorphism among bacterial /^sequences despite 
the fact that this gene is highly conseived. In fact, among the tuf sequences that we 
determined, we found many nucleotide variations as well as some deletions and/or 
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inseriions of amino acids. The selected universal primers were specific and ubiquitous 
for bacteria (Table 7). Of the 95 most clinically important bacterial species tested. 12 
were not amplified. These species belong to the genera Corynebacterium (1 1 species) 
and Stenotrophomonas (1 species). The universal primers did not amplify DNA of non- 
5 bacterial origin, including human and other types of eukaryotic DNA. 
EXAMPLE 2 : 

gplPrtinn of q pmis-s pecific PCR nrimers from tuf seq u e nces. As shown in 
Annexes 2 and 3, the comparison of tuf sequences from a variety of bacterial species 
allowed the selection of PCR primers specific for Enterococcus spp. or for 
10 Staphylococcus spp. The strategy used to design the PCR primers was based on the 
analysis of a multiple sequence alignment of various tuf sequences. These multiple 
sequence alignments include the tuf sequences of four representative bacterial species 
selected from each target genus as well as tuf sequences from species of other closely 
related bacterial genera. A careful analysis of those alignments allowed the selection 
1 5 of oligonucleotide sequences which are conserved within the target genus but which 
discriminate sequences from other closely related genera, thereby permitting the 
genus-specific and ubiquitous detection and identification of the target bacterial genus. 

For the selection of primers specific for Enterococcus spp. (Annex II), we have 
sequenced a portion of approximately 890 bp of the tuf genes for Enterococcus avium, 
20 E feecafe, E. faecium and E. gallinarum. All other tuf sequences used in the alignment 
were either sequenced by us or selected from databases. The analysis of this 
sequence alignment led to the selection of a primer pair specific and ubiquitous for 
Enterococcus spp. (Table 7). All of the 1 1 enterococcal species tested were efficiently 
amplified and there was no amplification with genomic DNA from bacterial species of 

25 other genera. 

For the selection of primers specific for Staphylococcus spp. (Annex III), we 
have also sequenced a portion of approximately 890 bp of the tuf genes for 
Staphylococcus aureus, S. epidermidis, S. saprophytics and S. simulans. All other 
tuf sequences used in the alignment were either sequenced by us or selected from 

30 databases. The analysis of this sequence alignment led to the selection of two pnmer 
pairs specific and ubiquitous for Sfapny/ococcus spp. (Table 7). Annex III shows the 
strategy used to select one of these two PCR primer pairs. The same strategy was 
used to select the other primer pair. Of the 14 staphylococcal species tested, one (S. 
sciuri) could not be amplified by the Stephy/ococcus-specific PCR assays using either 

35 one of these two primer pairs. For PCR assays using either one of these two primer 
pairs, there was no amplification with DNA from species of other bacterial genera. 
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EXAMPLE 3 : 

Section from faf sequences of pc.r P n n ar SD6dfin fnr ran ^ a ^_ 
shown in Annex IV, the comparison of ^sequences from a variety of bacterial and 
eukaryotic species allowed the selection of PCR primers specific for Candida albicans 
The strategy used to design the PCR primers was based on the analysis of a multiple 
sequence alignment of various fuf sequences. This multiple sequence alignment 
mcludes tuf sequences of five representative fungal species selected from the genus 
Candtda which were determined by our group (i.e. C. albicans, C. glabrata, C. krusei 
C. parapsilosis and C. tropicalis) as well as fc/f sequences from other closely related 
fungal species, fu/ sequences from various bacterial species were also included. A 
careful analysis of this sequence alignment allowed the selection of primers from the 
C albicans tuf sequence; these primers discriminate sequences from other closely 
related Candida species and other fungal species, thereby permitting the species- 
specific and ubiquitous detection and identification of C. albicans (Table 7). All of 88 
Candida albicans strains tested were efficiently amplified and there was no 
amplification with genomic DNA from other fungal or bacterial species 
EXAMPLE 4: 

Selection of PCR primers specific for Sitr^ tn coccuft fm m m # As shown in 
Annex V, the comparison of the various bacterial recA gene sequences available from 
databases (GenBank and EMBL) was used as a basis for the selection of PCR primers 
which are specific and ubiquitous for the bacterial genus Streptococcus. Since 
sequences of the recA gene are available for many bacterial species including five 
species of streptococci, it was possible to choose sequences well conserved within the 
genus Streptococcus but distinct from the recA sequences for other bacterial genera. 
When there were mismatches between the recA gene sequences from the five 
Streptococcus species, an inosine residue was incorporated into the primer (Annex V). 
The selected primers, each containing one inosine and no degenerescence, were 
specific and ubiquitous for Streptococcus species (Table 7). This PCR assay amplified 
all of the 22 streptococcal species tested. However, the Sfrepfococcus-specific assay 
did not amplify DNA from 3 out of 9 strains of S. mutans and 1 out of 3 strains of S. 
saiivarius. There was no amplification with genomic DNA from other bacterial genera 
(Table 7). 
EXAMPLE 5: 

Nucleotide sequencing of DNA fragment s. The nucleotide sequence of a portion 
of the ftrfgenes from a variety of bacterial or fungal species was determined by using 
the dideoxynucleotide chain termination sequencing method (Sanger et al 1977 
Proc. Natl. Acad. Sci. USA. 74:5463-5467). The sequencing was performed by using 
an Applied Biosystems automated DNA sequencer (model 373A) with their PRISM™ 
Sequenase* Terminator Double-stranded DNA Sequencing Kit (Perkin-Elmer Corp., 
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A pp«ed Blosystems Division, Foster City. CA). The sequencing strategy does not 
^ nlate lufA and MB genes because the sequencing primere hybndze etentty 
TT^l mJZ These DNA sequences are shown in the sequence M, 
° S Q , tr 8 o 146). The presence o, severe, degenerated nucieoddes in the 

^TpteTwere selected from the given proprietary DNA fragments or 
sequTnTs using the OI*o™ program and were synthesfced with an 
autonrated ABI DNA synthesizer (Model 391 , Petkin-Elmer Corp-, Applied Blosystems 
Division) using phosphoramidite chemtstty. 

WMm £2 ali *aU*** * hVhridirnlion — » Each oligonrideodde was 
, end-laS^n^ATP, hy the T4 polynucieo^e kinase (Phartnac*, as 
- Ho^rrihPd earlier The label could also be non-radioactive. 

were tel^heir specificity by hybridteationto DNAs from a vanety of bactena. and 
were tested tor i J~ 5 and 6 as described earlier. Spec.es-spec.fic 

TZZT ^eTwT^ hvhdd^tg ont, to DNA trom the mfcrobial 
, JZ TorC L wh,h it was .elated. O,*onuc,eo«e prohes found ,o he 

lab eied oiigonudeohde prohea as descdbed for ^^ a" ^s 

30 pies were LeV^ .o a, teas, 80% o, DNAs horn the bade* - CnKal 
isolates of the target species or genus. 

^me as 6 except dra, a p~. o. spec* -J— £ 

used ,o, microbial «enWcauon ,i) » h«ease senary and 

specimen. 
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EXAMPLE8: 
EXAMPLE 9: 

PCR amplification . The technique of PCR was used to increase th« * 
and the rapidity of the assau* Th« . , , • 'ncrease the sensitivity 

(see Ex am p, e ,0, ,o detom*. (heir specWy 

sp.Cc and ubiquitous PCR prfmer palre „ „ sted ,„ ^ * " 

selected^ Sf " N ' % ^^a^tes- The specMy of a» 
selected PCR pnmer patra was tested against DNAs from a variety of bacterial and 

apecflc for each speces or genus were then tested for their ublquKy to ensure that 

20 . a " d <**- isolates from around the world which ana 

20 representative of the diversity within each species or genus " «'<="'«» 

Standard precautions to avoid faise posit™ PCR results should be taken (Kwok 
and Htguchi. , 88 g, Nature, 23*237-238). Methods to inacfrvate toJJSS 

25 EXAMPLE 10t 

n^ JT^T ""° h " r "' ri "' ^ ^ate- PCR assays were 

perfonoed ehher diteofry front a bacteria, colony ot from a bactetia, suspension The 
Me bemg actuated to a standard McFartand 0.5 (corresponds to approximate* 5 
x 10- bactenafrnL,. , n case of direc , amp|fflcallon « 

30 colony was fransfetred using a plastic rod ditecdy Intoa 20„L PCR LptaTbZ 
fining 50 rnM KC. ,0 mM T„a-HC, (pH g o,. 0,% Tdton X-,^5 1 ^ 
0.4 „M of each pnmer. 200 „M of each of me bur dNTPs and 0.5 unit of raoDNA 
^=<^)comb ta ed»1 m ^T^» a ^ (c , 0 ^^^ 

standard McFarland 1.0 (corresponds to approximately 3.0 » 10- baoteLaL) 
concentrated 100 Umes by cenfr^ion was added d ire c% to lhB PCR re ~ b 
-~s te p fc c y eaa,^w,ape^ b ec 3UMaMcFa ^„ R d 7^ 
cans has approxrmately 200 times fewer celts man a McParland 0.5 tor bacteria, ce»s 
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PGR readies were then selected to therm* cycling (3 min at 95-C followed 
hv 30 cvcles of 1 second at 95-C for the denaturalion step and 30 seconds at 55 C 

£ he Sli— - *■* «*• a ptc - 2o ° ,hemai ^ rcR a r: 

-era then analyced b, standard agarose get (2%) 
Z^tton products were visualfced in agarose gets conlainrng 0.25 «,ml of 
' 2™de under UV a, 254 nn. The en*, PCR assay can be completed . 

^ (rom „ i9hly conserved ,egions ofthe bacterial 16S 
■r, I^NA oene were used to provide an interna! control for all PCR reactrons. 
, control was derived Iron, 00,00000= no, found In 

£££L or in the human genome. The internal control was Unrated ,n,o a 

■ nL,„. pcR Inhibition was absent. The internal control derrved from rRNA was also 

,5 iZLs-spe* „, g.nus-specif« amplifictions were performed srmultaneous,, 
in multiplex PCR assays. 

"""^j ^ "» PCR amp.Hica.ion performed 

20 containing suu mwi inw., treated urine specimen was 

fnr at least 1 5 minutes at room temperature, 1 M L of the treatea urine p 

TaaStart™ antibody (Clontech Laboratories Inc.). . otert ,nn 
L^s for .be inlema. cordrot, PCR ampliation and agarose gel detect™ 
of the amplicona are as previously described in example 10. 

cMcd specsnens. M of ba ceria as well 
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EXAMPLE 13: 

■den* bala, a J C p aCTt7"f ,'° T""* ^ ^ 

25 and may be automated On. of ! 0dS 3,6 " uant *aeve 

EXAMPLE 15: 

r --i^rerrt^r^a? 

clinical specimen. The scope of this invention is therefore not limited toThe^iseof the 
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DNA sequences from the enclosed Sequence Listing for PCR only but rather includes 
the use of any procedures to specifically detect bacterial DNA and which may be used 
to increase rapidity and sensitivity of the tests. 

EXAMPLE 16: . . 

5 A test kit would contain sets of probes specific for each m.crob.al spec.es or 

aenus as well as a set of universal probes. The kit is provided in the form of test 
components, consisting of the set of universal probes labeled with non-radioactive 
,abe.s as well as labeled species- or genus-specific probes for the detection of each 
pathogen of interest in specific types of clinical samples. The kit will also mclude test 

10 reagents necessary to perform the pre-hybrid^ation. hybridization, washing steps and 
' hybrid detection. Finally, test components for the detection of known ant.bfot.c 
resistance genes (or derivatives therefrom) will be included. Of course the krt w, 
include standard samples to be used as negative and positive controls for each 
hybnd.zat.or . test ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 

' " and to detect pathogens common* encountered in that type of specim en ^Reagente 
or the universal detection of bacteria will also be included. Based on the srtesof 
Lion, the following kits for the specific detection of P^ens may be devetop^. 
- Akrtforthe universal detection of bacteria, orfunga. pathogens from all elm cal 

20 specimens which contains sets of probes specific for highly conserved regions of the 

detedion Qf microbja , pathogens retneved fr0 m urine samples 
which contains 5 specific test components (sets of probes for the detection of 
Enterococcus faecium, Enteroccus spedes, Srapny/ococcus saprophytes, 
o<\ StaDhvlococcus species and Candida albicans). 

25 Sfap.y/ococc^ £ ^ ^ 3 speclfic fest 

components (sets of probes for the detection of Staphylococcus speaes. 
Enterococcus species and Candida albicans). 

- A kit for the detection of pathogens retrieved from blood samples. wh.ch 
30 contains 10 specific test components (sets of probes for the detection of 
ZZoccus species. Streptococcus aga/acf/ae. Stapnyiococcus speaes, 
Staphylococcus saprophytic Enterococcus species, ^terococcus 
NeLena species. Neisseria meningitidis, Listena monocytogenes an* Canada 
Zans). This kit can ateo be applied for direct detection and identified from blood 
35 cultures. ^ ^ ^ ^ ^ whjch contains 5 

spedfictestcomponents^ts of probes for thedetection of f^T^ 
listeria monocytogenes, Neissena meningitidis, Neissena speaes and Sfap/iy/occccus 

species). 
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- A kit for the detection of clinically important antibiotic resistance genes which 
contains sets of probes for the specific detection of at least one of the 26 following 
genes associated with antibiotic resistance: bla^ bla^ bta^ bla Ma , blaZ, aadB 
aacC1, aacC2, aacC3, aacA4, aacB'-lla, erwA, ermB, erwQ mecA, vanA, vanB vanC 
satA, aac(6')-aph(2"), aad(6'), vat, vga, msrA, sul and int. 

- Other kits adapted for the detection of pathogens from skin, abdominal wound 
or any other clinically relevant infections may also be developed 

EXAMPLE 17: 

Same as example 16 except that the test kits contain all reagents and controls 
to perform DNA amplification assays. Diagnostic kits will be adapted for amplification 
by PCR (or other amplification methods) performed directly either from clinical 
specmens or from microbial cultures. Components required for (i) universal bacterial 
detect.cn. (,.) species-specific and genus-specific bacterial and/or fungal detection and 
identification and (iii) detection of antibiotic resistance genes will be included 

Amplification assays could be performed either in tubes or in microtitration 
plates having multiple wells. For assays in plates, the wells will contain the specific 
amplication primers and control DNAs and the detection of amplification products will 
be automated. Reagents and amplification primers for universal bacterial detection will 
be mcluded in kits for tests performed directly from clinical specimens. Components 
requ,red for species-specific and genus-specific bacterial and/or fungal detection and 
.dentification as well as for the simultaneous antibiotic resistance genes detection will 
be included in kits for testing directly from bacterial or fungal cultures as well as in kits 
for testing directly from any type of clinical specimen. 

The kits will be adapted for use with each type of specimen as described in 
example 16 for hybridization-based diagnostic kits. 
EXAMPLE 18: 

It is understood that the use of the probes and amplification primers described 
in this invention for bacterial and/or fungal detection and identification is not limited to 
clinical microbiology applications. In fact, we feel that other sectors could also benefit 
from these new technologies. For example, these tests could be used by industries for 
quality control of food, water, air, pharmaceutical products or other products requiring 
microbiological control. These tests could also be applied to detect and identify 
bactena or fungi in biological samples from organisms other than humans (e.g other 
pnmates. birds, plants, mammals, farm animals, livestock and others). These 
diagnostic tools could also be veiy useful for research purposes including clinical trials 
and epidemiological studies. 

This invention has been described herein above, and it is readily apparent that 
modifications can be made thereto without departing from the spirit of this invention. These 
modifications are under the scope of this invention, as defined in th appended claims 
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Table 1. Distribution (%) of nosocomial pathogens for various human 
infections in USA (1990-1 992) 1 . 



30 



Pathogen 



5 Escherichia coli 

Staphylococcus aureus 
Staphylococcus epidermidis 
Enterococcus faecalis 
Enterococcus faecium 
10 Pseudomonas aeruginosa 
Klebsiella pneumoniae 
Proteus mirabilis 
Streptococcus pneumoniae 
Group B Streptococci 
15 Other Streptococci 

Haemophilus influenzae 
Neisseria meningitidis 
Listeria monocytogenes 
Other Enterococci 
20 Other Staphylococci 
Candida albicans 
Other Candida 
Enterobacter spp. 
Acinetobacter spp. 
25 C/fnobacterspp. 

Serratia marcescens 
Other Klebsiella 
Others 



UTI 2 SSI 3 BSI 4 Pneumonia CSF 5 



27 


y 


c 

%J 


4 


2 


2 


21 


17 
l f 


21 


2 


2 


o 




n 

w 


1 


16 


12 
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o 


o 


1 


1 


0 


n 
u 


n 


12 


9 


o 

o 


1 o 




7 


3 


4 


Q 


o 


5 


3 


4 
1 




o 


0 


0 


3 


1 


18 


1 


1 


2 


4 
1 


R 


3 


5 


2 


1 
I 


3 


0 


0 


0 


O 


45 


0 


0 


0 


n 


14 




0 


0 


0 


3 


1 


1 


0 


0 


0 


2 




8 


13 


20 


9 


3 


5 


5 


0 


2 




1 


3 


10 


5 


7 


4 


12 


2 


1 


1 


2 


4 


2 


2 


1 


1 


1 


0 


1 


1 


1 


3 


1 


1 


1 


1 


2 


1 


0_ 


6_ 


4 


5 


0_ 



Data recorded by the National Nosocomial Infections Surveillance (NNIS) from 80 
hospitals (Emori and Gaynes, 1993. Clin. Microbiol. Rev., 6:428-442). 

2 Urinary tract infection. 

3 Surgical site infection. 

4 Bloodstream infection. 
35 5 Cerebrospinal fluid. 



SUBSTITUTE SHEET (RULE 26) 



WO 98/20157 



-33- 



PCT/CA97/00829- 



Tabl 2. 



Distribution (%) of bio dstream infection pathog ns In 
Quebec (1995), Canada (1992), UK (1969-1988) and USA 
(1990-1992). 





uueuec 


Canada 2 


UK 3 




USA* 








Community- 


Hospital- 


Hospital- 








acquired 


acquired 


acquired 


E. coli 


ice 
lO.D 


CO o 

53.8 


24.8 


20.3 


5.0 


o. cfJlUcnjJIQiS 


25.8 


Nl* 


0.5 


7.2 


31.0 


and other CoNS s 










S. 3 u re us 


Q ft 


Ml 

Nl 


9.7 


19.4 


16.0 


o. pneumoniae 


6.3 


Nl 


22.5 


2.2 


NR 7 


*— . /cfC7Ua//o 


3.0 


Nl 


1.0 


4.2 


NR 


E. faecium 


2 6 


Ml 
INI 


0.2 


0.5 


NR 


Enterococcus 


NR 


Nl 


NR 


NR 


9.0 


spp. 










H. influenzae 


1.5 


NR 


3.4 


0.4 


NR 


P. aeruginosa 


1.5 


8.2 


1.0 


8.2 


3.0 


K. pneumoniae 


3.0 


11.2 


3.0 


9.2 


4.0 


P. mirabilis 


NR 


3.9 


2.8 


5.3 


1.0 


S. pyogenes 


NR 


Nl 


1.9 


0.9 


NR 


Enterobacter spp. 


4.1 


5.5 


0.5 


2.3 


4.0 


Candida spp. 


8.5 


Nl 


NR 


1.0 


8.0 


Others 


18.5 


17.4 8 


28.7 


18.9 


19.0 



Data obtained for 270 isolates collected at the Centre Hospitaller de I'Universite 
Laval (CHUL) during a 5 month period (May to October 1995). 
Data from 10 hospitals throughout Canada representing 941 gram-negative 
bacterial isolates. (Chamberland etai, 1992, Clin. Infect. Dis., 15:615-628). 
Data froma20-yearstudy(1969-1988) for nearly 4000 isolates (Eykynef at., 1990, 
J. Antimicrob. Chemother., Suppl. C, 25:41-58). 

Data recorded by the National Nosocomial Infections Surveillance (NNIS) from 80 
hospitals (Emori and Gaynes, 1993, Clin. Microbiol. Rev., 6:428-442). 
Coagulase-negative staphylococci. 

Nl, not included. This survey included only gram-negative species. 

NR, incidence not reported for these species or genera. 

In this case, 17.4 stands for other gram-negative bacterial species. 
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10 



15 



Table 3. 



Distribution of positive and negative clinical specimens tested at 
the microbiology laboratory of the CHUL (February 1994 - January 
1995). 



ClintCdl specimens 


No. of samples 


% of positive 


% of negative 


and/or sites 


tested (%) 


specimens 


specimens 


Urine 


17,981 (54.5) 


19.4 


80.6 


Blood culture/marrow 


10 010 (30.4) 


6.9 


93.1 


Sputum 


1 266 (3.8) 


68.4 


31.6 


Superficial pus 


1 1 **fi (% G\ 


72.3 


27.7 


Cerebrospinal fluid 


553 (1.7) 


1.0 


99.0 


Synovial fluid 


523 (1.6) 


2.7 


97.3 


Respiratory tract 


502(1.5) 


56.6 


43.4 


Deep pus 


473 (1.4) 


56.8 


43.2 


Ears 


289 (0.9) 


47.1 


52.9 


Pleural and pericardial 


132 (0.4) 


1.0 


99.0 


fluid 






71.4 


Peritoneal fluid 


101(0.3) 


28.6 


Total: 


32,966 (100.0) 


20.0 


80.0 
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Gram-negative bacterial species (90) used to test the specificity of 
PCR primers and DNA probes (continues on next page). 



Bacterial species 



Number of 
reference 
strains 
tested" 



Bacterial species 



Acinetobacter baumannii 1 

Acinetobacter Iwoffii 3 

Actinobacillus lignieresii 1 

Alcaligenes faecalis 1 

Alcaligenes odorans 1 
Alcaligenes xylosoxydans 

subsp. denitrificans 1 

Bacteroides distasonis 1 

Bacteroides fragilis 1 

Bacteroides ovatus 1 

Bacteroides 1 
thetaiotaomicron 

Bacteroides vulgatus 1 

Bordetella bronchiseptica 1 

Bordetella parapertussis 1 

Bordetella pertussis 2 

Burkholderia cepacia 1 

Citrobacter amalonaticus 1 

Citrobacter diversus 2 

subsp. /cosen 

Citrobacter freundii 1 

Comamonas acidovorans 1 

Enterobacter aerogenes 1 

Enterobacter 1 
agglomerans 

Enterobacter cloacae 1 

Escherichia coli 9 

Escherichia fergusonii 1 



Moraxella phenylpyruvica 
Morganella morganii 
Neisseria animalis 
Neisseria canis 
Neisseria caviae 
Neisseria cinerea 
Neisseria cuniculi 
Neisseria elongata 

subsp. elongata 
Neisseria elongata 

subsp. glycoytica 
Neisseria flavescens 
Neisseria flavescens 
Branham 

Neisseria gonorrhoeae 
Neisseria factamica 
Neisseria meningitidis 
Neisseria mucosa 
Neisseria polysaccharea 
Neisseria sicca 
Neisseria subflava 

Neisseria weaveri 
Ochrobactrum antropi 
Pasteurella aerogenes 
Pasteurella multocida 

Prevotella melaninogenica 
Proteus mirabilis 
Proteus vulgaris 



Number of 
reference 
strains 



tes 



ted* 



4 
2 
1 
3 
3 

1 
1 
1 
1 

1 
3 
1 



SUBSTITUTE SHEET (RULE 26) 



WO 98/20157 



-36 



PCT/CA97/00829 ~ 



Bacterial species 



Number of 
reference 
strains 
tested 1 



Bacterial species 



Number of 
reference 
strains 
tested" 



Escherichia hermannii 
Escherichia vulneris 
Flavobacterium 
meningosepticum 
5 Flavobacterium 
indologenes 

Flavobacterium odoratum 
Fusobacterium 
necrophorum 
1 o Gardnerella vaginalis 
Haemophilus 
haemolyticus 
Haemophilus influenzae 
Haemophilus 
15 parahaemolyticus 
Haemophilus 
parainfluenzae 
Hafnia alvei 
Kingeila indologenes 
20 subsp. suttonella 

Kingeila kingae 
Klebsiella ornithinolytica 
Klebsiella oxytoca 
Klebsiella pneumoniae 
25 Moraxella atlantae 
Moraxella catarrhalis 

Moraxella lacunata 
Moraxella osloensis 



30 



1 Providencia alcalifaciens 

1 Providencia rettgeri 

1 Providencia rustigianii 

1 Providencia stuartii 

1 Pseudomonas aeruginosa 

2 Pseudomonas fluorescens 

1 Pseudomonas stutzeri 

1 Salmonella arizonae 

1 2 Salmonella choleraesuis 

1 Salmonella gallinarum 

2 Salmonella typhimurium 

1 Serratia liquefaciens 
1 Serratia marcescens 



Shewanella putida 
Shigella boydii 
Shigella dysenteriae 
Shigella flexneri 
Shigella sonnei 
Stenotrophomonas 
maltophilia 

Yersinia enterocolitica 



1 
1 



1 
1 
1 

1 

14 

2 

1 
1 

1 
1 



Most reference strains were obtained from the American Type Culture Collect^ 
(ATCO. The other ref rence strains were obtained from (i) the Labora to.re da 
Sante Publiqu du Quebec (LSPQ). 00 the Center for O^ease Co^ and 
Prevention (CDC) and (in) the National Culture Type Collection (NCTC). 
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Gram-positiv bacterial species (97) used to test the specificity of 
PCR primers and DNA probes (continu s on next page). 



Bacterial species 



Number of 
reference 
strains 
tested 3 



Bacterial species 



Abiotrophia adiacens 1 

Abiotrophia defectiva 1 

Actinomyces israelii 1 

Clostridium perfringens 1 

Corynebacterium accolens 1 

Corynebacterium 1 
aquaticum 

Corynebacterium bovis 1 

Corynebacterium cervicis 1 

Corynebacterium 6 
diphteriae 

Corynebacterium 1 
flavescens 

Corynebacterium q 
genitalium 

Corynebacterium jeikeium 1 
Corynebacterium kutcheri 1 
Corynebacterium 1 
matruchotii 

Corynebacterium 1 
minutissimum 

Corynebacterium j 
mycetoides 

Corynebacterium 1 
pseudodiphtheriticum 
Corynebacterium q 
pseudogenitalium 
Corynebacterium renale 1 
Corynebacterium striatum 1 
Coryneba cterium ulcerans 1 



Micrococcus kristinae 
Micrococcus luteus 
Micrococcus lylae 
Micrococcus roseus 
Micrococcus varians 
Peptococcus niger 

Peptostreptococcus 
anaerobius 
Peptostreptococcus 
asaccharolyticus 
Staphylococcus aureus 

Staphylococcus auricularis 

Staphylococcus capitis 

subsp. urealyticus 
Staphylococcus cohnii 
Staphylococcus epidermidis 
Staphylococcus 
haemolyticus 
Staphylococcus hominis 

Staphylococcus 
lugdunensis 
Staphylococcus 
saprophytics 
Staphylococcus schleiferi 

Staphylococcus sciuri 
Staphylococcus simulans 
Staphylococcus warneri 



Number of 
reference 
strains 
tested 8 



1 
1 
1 
1 
1 
1 

1 

1 

10 

1 

1 

1 
2 
2 



1 

3 

1 

1 
1 
1 
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Bacterial species 



Number of 
reference 
strains 
tested 3 



Bacterial species 



PCI7CA97/00829 

Number of 
reference 
strains 
tested 3 



25 



30 



Corynebacterium 
urealyticum 

Corynebacterium xerosis 
Enterococcus avium 
Enterococcus 
casseliflavus 
Enterococcus cecorum 
Enterococcus dispar 
Enterococcus durans 
Enterococcus faecalis 
Enterococcus faecium 
Enterococcus flavescens 
Enterococcus gallinarum 
Enterococcus hirae 
Enterococcus mundtii 
Enterococcus 
pseudoavium 
Enterococcus raffinosus 
Enterococcus 
saccharolyticus 
Enterococcus solitarius 
Eubacterium lentum 
Gemella haemolysans 
Gemella morbiltorum 
Lactobacillus acidophilus 
Listeria innocua 
Listeria ivanovii 
Listeria grayi 
Listeria monocytogenes 
Listeria murrayi 
Listeria seeligeri 
Listeria welshimeri 



1 Staphylococcus xylosus 

1 Streptococcus agalactiae 

1 streptococcus anginosus 

1 Streptococcus bovis 

1 Streptococcus constellatus 

1 Streptococcus crista 

1 Streptococcus dysgalactiae 

6 Streptococcus equi 

3 Streptococcus gordonii 

1 Group C Streptococci 

3 Group D Streptococci 

1 Group E Streptococci 

1 Group F Streptococci 

1 Group G Streptococci 

1 streptococcus intermedius 

1 Streptococcus mitis 

1 Streptococcus mutans 

1 Streptococcus oralis 

1 streptococcus parasanguis 

1 Streptococcus pneumoniae 

1 Streptococcus pyogenes 

1 Streptococcus salivarius 

1 Streptococcus sanguis 

1 Streptococcus sobrinus 

3 Streptococcus suis 

1 streptococcus uberis 

1 streptococcus vestibularis 

1 



1 



1 

2 



1 

1 

6 

3 

2 

2 

1 

1 

1 

1 



. Most rete-ence strains were obtained from the Ameriran Type Culture Crrfledron 
35 (ATCC). Toe other referenoe strains were obtained from ft the 

and (111) the National Culture Type Collection (NCTC). 
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Table 6. Fungal species (12) us d to test the specificity of PGR primers and 
DNA probes. 



Fungal species 



Candida albicans 

Candida giabrata 

Candida guilliermondii 

Candida kefyr 

Candida krusei 

Candida lusitaniae 

Candida parapsilosis 
Candida tropicalis 
Rhodotorula glutinis 
Rhodotorula minuta 
Rhodotorula rubra 
Saccharomyces cerevisiae 



Number of reference 
strains tested 3 



12 
1 
1 
3 
2 
1 
2 
3 
1 
1 
1 
1 



Most reference strains were obtained from (i) the American Type Culture Collection 
(ATCC) and (n) the Laboratoire de Sante Publique du Quebec (LSPQ). 
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PCR assays developed for several clinically important bacterial 



25 



30 



Organism Primer Pair* Amplicon Ubiquity b DNA amplification from 

SEQ ID NO size (bp) culture 0 specimens d 


Fnttsmmrcus fsedUff) 

1 III r?J UL»Ul>UUw fww»*"»» 


1-2 


216 


79/80 




+ 


Listeria rnonu^yi^y^ n ^° 


3-4 


130 


164/168" 


+ 


+ 


Neisseria meningitidis 


5-6 


177 


258/258 


+ 


+ 




7-8 


149 


245/260 


+ 


NT 


saprophytics 












Streptococcus 


9-10 


154 


29/29 


+ 


+ 


agalactiae 










NT 


Candida albicans 


11-12 


149 


88/88 


+ 


Enterococcus 


13-14 


112 


87/87 


T 


NT 


spp. (1 1 species)' 












Neisseria spp. 


15-16 


103 


321/321 




+ 


(12 species)' 










NT 


Staphylococcus spp. 


17-18 


192 


13/14 


+ 


(14 species) 










NT 




19-20 


221 


13/14 


+ 


Streptococcus spp. 


21-22 


153 


210/214^ 


+ 


+ 


(22 species)' 












Universal detection" 
(95 species) 1 


23-24 


309 


104/ 116 s 


+ 


+ 



All primer pairs are specific in PCR assays since no amplification was observed 
with DNA from the bacterial and fungal species other than the species of interest 
listed in Tables 4, 5 and 6. 

Ubiquity was tested by using reference strains as well as strains from throughout 
the world, which are representatite of the diversity within each target species or 
genus. 

For all primer pairs, PCR amplifications performed directly from a standardized 
microbial suspension (MacFarland) or from a colony were all specific and 
ubiquitous. 

PCR assays performed directly from blood cultures, urine specimens or 
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cerebrospinal fluid. NT, not tested. 

The four L monocytogenes strains undetected are not clinical isolates. These 
strains were isolated from food and are not associated with a human infection. 
The bacterial species tested include all those clinically relevant for each genus 
(Tables 4 and 5). All of these species were efficiently amplified by their respective 
genus-specific PGR assay, except for the Sfapny/ococcus-specific assay, which 
does not amplify S. sciuri. 

9 The S/reptoccus-specific PGR assay did not amplify 3 out of 9 strains of S 
mutans and 1 out of 3 strains of S. salivarius. 

The primers selected for universal bacterial detection do not amplify DNA of non- 
bactenal origin, including human and other types of eukaryotic genomic DNA. 
For the universal amplification, the 95 bacterial species tested represent the most 
clinically important bacterial species listed in Tables 4 and 5. The 12 strains not 
amplified are representatives of genera Corynebacterium (11 species) and 
Stenotrophomonas (1 species). 



!. Target genes for the various genus-specific, species-specific and 
universal amplification assays. 



Microorganisms 


Gene 


Protein encoded 


Candida albicans 


tuf 


translation elongation factor EF-Tu 


Enterococcus faecium 


ddl 


D-aIanine:D-alanine ligase 


Listeria monocytogenes 


actA 


actin-assembly inducing protein 


Neisseria meningitidis 


omp 


outer membrane protein 


Streptococcus agaiactiae 


cAMP 


cAMP factor 


Staphylococcus 


unknown 


unknown 


saprophyticus 






Enterococcus spp. 


tuf 


translation elongation factor EF-Tu 


Neisseria spp. 


asd 


ASA-dehydrogenase 


Staphylococcus spp. 


tuf 


translation elongation factor EF-Tu 


Streptococcus spp. 


recA 


RecA protein 


Universal detection 


tuf 


translation elongation factor EF-Tu 
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Genes 



bla oxa 

blaZ 
aac6'-lla 
ermA 
ermB 
ermC 
vanB 
vanC 
aad(&) 



-42- 



Antibiotic resistance genes selected for diagnostic purpos s. 



SEQ ID NOs 



Antibiotics 



Bacteria 3 



selected 
primers 

49-50 



originating 
fragment 



51-52 
61-64 
91-92 
93-94 
95-96 
71-74 
75-76 
173-174 



110 p-lactams 

111 (J-lactams 

112 Aminoglycosides 

113 Macrolides 

114 Macrolides 

115 Macrolides 

116 Vancomycin 

117 Vancomycin 
Streptomycin 



Enterobacteriaceae, 
Pseudomonadaceae 

Enterococcus spp. 
Pseudomonadaceae 
Staphylococcus spp. 
Staphylococcus spp. 
Staphylococcus spp. 

Enterococcus spp. 

Enterococcus spp. 

Enterococcus spp. 



Bacteria having high incidence for the specified antibiotic resistance genes The 
presence of these antibiotic resistance genes in other bacteria is not excluded. 
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Antibiotic resistance genes from our co-pending US INS 
08K2S840, and PCX (PCT^OO^, patent appLls for 
wh.chw have selected PCR primer pairs. 



10 



15 



20 



25 



bla, 



tern 



blBrob 
blashv 



aadB 
aacC1 
aacC2 
aacC3 
aacAA 
mecA 
vanA 
satA 
aac(&)-aph(2 n ) 

vat 
vga 
msrA 
int 

sul 



SEQ ID NOs 
of selected primers 



Bacteria 3 



37-40 



45-48 
41-44 



53-54 

55-56 

57-58 

59-60 

65-66 

97-98 

67-70 

81-82 

83-86 

87-88 
89-90 
77-80 
99-102 

103-106 



p-lactams 



P-lactams 
P-lactams 

Aminoglycosides 



P-lactams 
Vancomycin 
Macrolides 
Aminoglycosides 

Macrolides 
Macrolides 
Erythromycin 
P-lactams, 
trimethoprim, 
aminoglycosides, 

antiseptic, 
chloramphenicol 



Enterobacteriaceae, 
Pseudomonadaceae, 
Haemophilus spp., 

Neisseria spp. 
Haemophilus spp., 
Pasteurella spp. 
Klebsiella spp. 
and other 
Enterobacteriaceae 
Enterobacteriaceae, 
Pseudomonadaceae 



Staphylococcus spp. 

Enterococcus spp. 

Enterococcus spp. 

Enterococcus spp., 
Staphylococcus spp. 
Staphylococcus spp. 
Staphylococcus spp. 
Staphylococcus spp. 
Enterobacteriaceae, 

Pseudomonadaceae 



30 



Bacteria having high incidence for the specified antibiotic resistance genes The 
presence of these antibiotic resistance genes in other bacteria is not excluded. 
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Tabl 11. Correlation between disk diffusion and PCR amplification of 
antibiotic resistanc genes in Staphylococcus species*. 



Disk diffusion (Kirby-Bauer) b 



Antibiotic Phenotype PCR Resistant Intermediate Sensitive 



15 



reniciiHii 


blaZ 


+ 


165 


0 


0 








n 
u 


n 


O 1 


Oxacillin 


mecA 


+ 


51 


11 


4 








2 


0 


128 


Gentamycin 


aac(6')aph(2") 


+ 


24 


18 


6 








0 


0 


148 


Erythromycin 


ermA 


+ 


15 


0 


0 




ermB 


+ 


0 


0 


0 




ermC 




43 


0 


0 




msrA 


+ 


4 


0 


0 








0 


1 


136 



a The Staphylococcus strains studied include S. aureus (82 strains), S. epidermidis 
(83 strains), S. hominis (2 strains), S. capitis (3 strains), S. haemolyticus (9 
strains), S. simulans (12 strains) and S. wameri (5 strains), for a total of 196 
strains. 

b Susceptibility testing was performed by the method of Kirby-Bauer according to the 
protocol reccommended by the National Committee of Clinical Laboratory 
Standards (NCCLS). 
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Table 12. Correlation between disk diffusion profiles and PCR amplification 
of antibiotic resistance g nes in Enterococcus sped s a . 









Disk diffusion (Kirby-Bauer) b 


Antibiotic 


Phenotype 


PCR 


Resistant 


Sensitive 




blaZ 


+ 


0 


2 


Ampicillin 














- 


i 
i 


30 


Gentamycin 


aac(6')aph(2") 


+ 


51 


1 








3 


38 


Streptomycin 


aad(&) 


+ 


26 


15 








6 


27 


Vancomycin 


vanA 


+ 


36 


0 




vanB 


+ 


26 


0 








0 


40 



The Enterococcus strains studied include £ faecalis (33 strains) and £ faecium 
(69 strains), for a total of 102 strains. 

Susceptibility testing was performed by the method of Kirby-Bauer according to the 
protocol reccommended by the National Committee of Clinical Laboratory 
Standards (NCCLS). 
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Table 13. Origin of tuf sequenc s in th S quence Listing (continues on next 
page). 



10 



15 



20 



25 



30 



SEQ ID NO 


Bacterial or fungal species 


Source 


118 


Abiotrophia adiacens 


This patent 


119 


Abiotrophia defectiva 


This patent 


120 


Candida albicans 


This patent 


121 


Candida glabrata 


This patent 


122 


Candida krusei 


This patent 


123 


Candida parapsilosis 


This patent 


124 


Candida tropicalis 


This patent 


125 


Corynebacterium accolens 


This patent 


126 


Corynebacterium diphteriae 


This patent 


127 


Corynebacterium genitalium 


This patent 


128 


Corynebacterium jeikeium 


This patent 


129 


Corynebacterium 


This patent 




pseudotuberculosis 




130 


Corynebacterium striatum 


This patent 


131 


Enterococcus avium 


This patent 


132 


Enterococcus faecalis 


This patent 


133 


Enterococcus faecium 


This patent 


134 


Enterococcus gallinarum 


This patent 


135 


Gardnerella vaginalis 


This patent 


136 


Listeria innocua 


This patent 


137 


Listeria ivanovii 


This patent 


138 


Listeria monocytogenes 


This patent 


139 


Listeria seeligeri 


This patent 


140 


Staphylococcus aureus 


This patent 


141 


Staphylococcus epidermidis 


This patent 


142 
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10 



15 



20 



25 



SEQ ID NO Nucleotide sequence 



Origin ating DNA fragment 
SEQ ID Nucleotide 



NO 



position 



Z Enteroco^uTTa^T 



30 



l 

2" 



5. -TGC TTT AGC AAC AGO CTA TCA G 
5 ' -TAA ACT TCT TCC GGC ACT TCG 



ea£t£liai _ £E££i££: Listeria .onocyto^enes 



3 

4 b 



5 « -TGC GGC TAT AAA TGA AGA GGC 
5 i -ATC CGA TGA TGC TAT GGC TTT 



26* 
26 a 



27 a 
27 a 



28* 
28 a 



ga£i££iai _ SE££i££: Ke isseria meningitidis 

5 5- -CCA GCG GTA TTG TTT GGT GGT 

S b 5 1 -CAG GCG GCC TTT AAT AAT TTC 

• i onB , ies . staphylococcus saprophytics 
Fa ctftriai i, species. 

5 1 - AGA TCG AAT TCC ACA TGA AGG TTA TTA TGA 29< 
8 „ 5 ■ - TCG CTT CTC CCT CAA CAA TCA AAC TAT CCT 29«= 

Ea£tsEial _ SE££iss: streptococcus agalactiae 



10" 



5 .. TTT CAC CAG CTG TAT TAG AAG TA 
5.-GTT CCC TGA ACA TTA TCT TTG AT 



30 a 
30' 



Bingal _ 5Ee£i£S: Candida albicans 



273-294 
468-468 



339-359 
448-46B 



56-76 
212-232 



290-319 
409-438 



59-81 
190-212 




35 



Sequences from databases. 
> These sequences are from the opposite DNA strand of the sequence of the 

originating fragment given in the Sequence Listxng. 
« Sequences determined by our group. 
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13l-i34*' b 
131-134*-* 



Bacterial gaimaj. Neisseria 



15 5'-CTG GCG CGG TAT GGT CGG TT 
16 d 5 ' -GCC GAC GTT GGA ACT GGT AAA G 
Bacterid <T*>1W Staphylococcus 



17 
18 d 
19 
20 d 



5'-CCG TGT TGA ACG TGG TCA AAT CAA A 
5'-TRT GTG GTG TRA TWG WRC CAG GAG C 
5'-ACA ACG TGG WCA AGT WTT AGC WGC T 
5'-ACC ATT TCW GTA CCT TCT GGT AAG T 



Bacterial ganug^ Streptococcus 



21 

22 d 



5 • -GAA ATT GCA GGI AAA TTG ATT GA 
5'-TTA CGC ATG GCI TGA CTC ATC AT 



31« 
31« 



140-143»' b 
14Q-143 a - b 
140-143* b 
140-143 a ' b 



32-36* 
32-36 e 



319-340 C 
410-430° 



21-40' 
102-123* 



391-4159 
584-608 9 
562-5839 
729-7539 



418-440 h 
547-569 h 



Universal primers 
23 5'-ACI KKI ACI GGI GTI GAR ARG TT 

24^ 5 1 -AYR TTI TCI CCI GGC ATI ACC AT 



118-146»." 493-515* 
147-171*- e 

118-146*^ 778-800 1 
147-171*^ 



35 



• These sequences were aligned to derive the corresponding primer. 
b tuf sequences determined by our group. 
The nucleotide positions refer to the * 

(SEQ ID NO: 132). Recalls tuf gene fragment 

d These sequences are from the opposite DNA strand of m, 
• Sequences from databases. 

' The nucleotide positions refer to the N meninaiM„- „ 

(SEQ id NO: 31) . *»>*>Bxtidia asd gene fragment 
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9 The nucleotide positions refer 

ID NO: 140) . 
* The nucleotide positions refer 

NO: 34) . 

5 * The nucleotide positions refer 
NO: 154) . 



the S. aureus tuf gene fragment (SEQ 
the S. pneumoniae recA gene (SEQ ID 
the £?. coli tuf gene fragment (SEQ ID 
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SEQ id no Nucleotide sequence 



antibiotic res< S h^ ra ^ nr . 

37 5'-CTA TGT GGC GCG GTA TTa"tC 

38 5--CGC AGT GTT ATC ACT CAT GG 



39 
40 



5' -CTG AAT GAA GCC ATA CCA AA 
5 • - ATC AGC AAT AAA CCA GCC AG 



Ant i biotic reai^^ o ^ pr . b2a ^ 



41 
42 

43 
44 



5' -TTA CCA TGA GCG ATA ACA GC 
5'-CTC ATT CAG TTC CGT TTC CC 

5 '-CAG CTG CTG CAG TGG ATG GT 
5'-CGC TCT GCT TTG TTA TTC GG 



Antibiotic rp.gi«^ n c g qfn <a> . b J 



45 
46 

47 
48 



5'-TAC GCC AAC ATC GTG GAA AG 
5 1 - TTG AAT TTG GCT TCT TCG GT 

5 » -GGG ATA CAG AAA CGG GAC AT 
5 • -TAA ATC TTT TTC AGG CAG CG 



Antibiotic Egaiajta a sg asn g; hia 0 



49 
50* 



Antibiotic Easiai^Qsg a s n a: u«z 



51 
52 b 



5 ' -ACT TCA ACA CCT GCT GCT TTC 
5 1 -TGA CCA CTT TTA TCA GCA ACC 



Antibiotic r e ffi fl f ar r o ggQg. aadB 



53 
54 



5 1 -GGC AAT AGT TGA AAT GCT CG 
5 ' -CAG CTG TTA CAA CGG ACT GG 



Originating DNA fragmen t 
SEQ id Nucleotide 
position 



NO 



5 '-GAT GGT TTG AAG GGT TTA TTA TAA G U0» 
5 ' -AAT TTA GTG TGT TTA GAA TGG TGA T HO* 



lll a 
111* 



686-710 
802-826 



511-531 
663-683 



Antibiotic rsaig&aB fis ggpg: aacCl 



55 
56 



5 1 -TCT ATG ATC TCG CAG TCT CC 
5' -ATC GTC ACC GTA ATC TGC TT 



Sequences from databases. 

- These sequences are from the opposite DNA strand of the sequence of the 
orxgxnating fragment given in the Sequence Listing. 
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, specific and ubiquitous primers for DNA amplification 



10 



15 



20 



25 



30 



35 



SEQ 



ID NO Nucleotide sequence 



BnfcibififciS resistance ggpe: aacC2 

57 5 1 - CAT TCT CGA TTG CTT TGC TA 

58 5 ' - CCG AAA TGC TTC TCA AGA TA 

59 5--CTG GAT TAT GGC TAC GGA GT 

60 S'-AGC AGT GTG ATG GTA TCC AG 

aafc^Siatic Esaiafcaass aeaa= aae*--iia 

61 5--GAC TCT TGA TGA AGT GCT GG 
62 b 5 .-CTG GTC TAT TCC TCG CAC TC 

63 5 1 - TAT GAG AAG GCA GGA TTC GT 
64 » 5 ' -GCT TTC TCT CGA AGG CTT GT 

anti fciatig r^ifit a rcQ tens : aacJU 

65 5 '-GAG TTG CTG TTC AAT GAT CC 

66 5 1 -GTG TTT GAA CCA TGT ACA CG 

anti biotic E saiatanss ssds.- aad(6') 

173 5 • -TCT TTA GCA GAA CAG GAT GAA 

174 5 ' - GAA TAA TTC ATA TCC TCC G 

^kw^h rP «n stance ggn£= vanA 

67 S'-TGTAGAGGT CTA GCC CGT GT 

68 5--ACG GGG ATA ACG ACT GTA TG 



40 



69 
70 



5 ' -ATA AAG ATG ATA GGC CGG TG 
5 • -TGC TGT CAT ATT GTC TTG CC 



nririh1 „t-i^ Esaiasaass gene; vanB 

7X 5- -ATT ATC TTC GGC GGT TGC TC 
72* 5.-GAC TAT CGG CTT CCC ATT CC 



45 



73 5 - -CGA TAG AAG CAG CAG GAC AA 
74b S '-CTH RTG GAT GCG GAA GAT AC. 



Originating DNA fragment 
SEQ ID Nucleotide 
NO position 



112" 
112" 

112" 
112" 



116" 
116* 

116" 
116" 



123-142 
284-303 

445-464 
522-541 



22-41 
171-190 

575-594 
713-732, 



» Sequences from databases. 

a « from the opposite DNA strand of the sequence of the 
» These sequences are rrom we 

originating fragment given in the Sequence Listing. 
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Annex VI: Specific and ubiquitous primers for DNA amplif ication 



SEQ id NO Nucleotide sequence 




Originating DNA fragme nt 
SEQ ID Nucleotide 
NO position 



75 
76 b 



5»-GCC TTA TGT ATG AAC AAA TGG 

5 • -GTG ACT TTW GTG ATC CCT TTT GA 



117* 
117* 



Antibiotic r^al&Un gs "rn~- marA 



11 
78 



5 • -TCC AAT CAT TGC ACA AAA TC 
5»-AAT TCC CTC TAT TTG GTG GT 



19 5 » - TCC CAA GCC AGT AAA GCT AA 
80 5' -TGG TTT TTC AAC TTC TTC CA 

Antibiotic caaiatangg gsns.: satA 



81 
82 



5»-TCA TAG AAT GGA TGG CTC AA 
5'-AGC TAC TAT TGC ACC ATC CC 



Antibiotir rj»sjj-&ancjg gens: aac(6> )-aph(2») 



83 
84 

85 
86 



5 ' -CAA TAA GGG CAT ACC AAA AAT C 
5'- CCT TAA CAT TTG TGG CAT TAT C 

5 ' -TTG GGA AGA TGA AGT TTT TAG A 
5 1 - CCT TTA CTC CAA TAA TTT GGC T 



Antibiotic r^i.^^n. g~ n — rat 



87 
88 



5 • -TTT CAT CTA TTC AGG ATG GG 
5 » -GGA GCA ACA TTC TTT GTG AC 



Antibiotic raaiafcan gfi q- n r - vg a 

89 5 r -TGT GCC TGA AGA AGG TAT TG 

90 5'-CGT GTT ACT TCA CCA CCA CT 



Antibiotic r^<n ,c ^ nce Qf » r o. 



ermA 



91 
92 b 



5 . -TAT CTT ATC GTT GAG AAG GGA TT 
5' -CTA CAC TTG GCT TAG GAT GAA A 



113 a 
113 a 



373-393 
541-563 



370-392 
487-508 



45 > Sequences from databases. 

> These sequences are from the opposite n strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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10 



15 



20 



25 



30 



35 



&n£jfciafciS resistant gene,: ermB 



93 
94 b 



5 • -CTA TCT GAT TGT TGA AGA AGG ATT 
5' -GTT TAC TCT TGG TTT AGG ATG AAA 



^aUbifiUg gaaia^anse gene: ermc 



95 
96 b 



5 ' -CTT GTT GAT CAC GAT AAT TTC C 
5 • -ATC TTT TAG CAA ACC CGT ATT C 



97 
98 



5 ' -AAC AGG TGA ATT ATT AGC ACT TGT AAG 
5. -ATT GCT GTT AAT ATT TTT TGA GTT GAA 



99 5' -GTG ATC GAA ATC CAG ATC C 

100 5 1 -ATC CTC GGT TTT CTG GAA G 

101 5 1 -CTG GTC ATA CAT GTG ATG G 

102 5 ' -GAT GTT ACC CGA GAG CTT G 

apjafcjgXiS reoi^anre gene,: buI 

103 5 ' -TTA AGC GTG CAT AAT AAG CC 

104 5'-TTG CGA TTA CTT CGC CAA CT 



PCT/CA97/00829 



amplification 

Originating DNA 
fragment 

SEQ Nucleotide 
ID NO position 



114" 
114 a 



115 s 
115° 



366-389 
484-507 



214-235 
382-403 




* Sequences from databases. 



> These seances are fro. the opposite PN* strand of the sequence of the 
originating fragment given in the Sequence Listing. 



SUBSTITUTE SHEET (RULE 26) 



WO 98/20157 

PCT/CA97/00829 - 

- 71 - 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: INFECTIO DIAGNOSTIC (I.D.I.) INC 

(B) STREET: 2050, BOULEVARD RENE LEVESQUE OUEST, 4E ETAGE 

(C) CITY: STE-FOY 

(D) STATE: QUEBEC 

(E) COUNTRY: CANADA 

(F) POSTAL CODE (ZIP) : G1V 2K8 

(G) TELEPHONE: (418) 681-4343 

(H) TELEFAX: (418) 681-5254 

(A) NAME: BERGERON , MICHEL G. 

(B) STREET: 2069 RUE BRULARD 

(C) CITY: SILLERY 

(D) STATE: QUEBEC 

(E) COUNTRY: CANADA 

(F) POSTAL CODE (ZIP) : GIT 1G2 

(A) NAME: PICARD, FRANCOIS J. 

(B) STREET: 124 5, RUE DE LA SAPINIERE 

(C) CITY: CAP -ROUGE 

(D) STATE: QUEBEC 

(E) COUNTRY: CANADA 

(F) POSTAL CODE (ZIP) : G1Y 1A1 

(A) NAME: OUELLETTE , MARC 

(B) STREET: 1035 DE PLOERMEL 

(C) CITY: SILLERY 

(D) STATE: QUEBEC 

(E) COUNTRY: CANADA 

(F) POSTAL CODE (ZIP) : G1S 3S1 

(A) NAME: ROY, PAUL H. 

(B) STREET: 28, RUE CHARLES GARNI ER 

(C) CITY: LORETTEVILLE 

(D) STATE: QUEBEC 

(E) COUNTRY: CANADA 

(F) POSTAL CODE (ZIP) : G2A 3S1 

(ii) TITLE OF INVENTION: SPECIES -SPECIFIC, GENIUS -SPECIFIC AND 

UNIVERSAL DNA PROBES AND AMPLIFICATION PRIMERS TO RAPIDLY 
DETECT AND IDENTIFY COMMON BACTERIAL AND FUNGAL PATHOGENS 
AND ASSOCIATED ANTIBIOTIC RESISTANCE GENES . . . 

(iii) NUMBER OF SEQUENCES : 174 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
(vi) PRIOR APPLICATION DATA: 
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(A) APPLICATION NUMBER: US 08/743,637 

(B) FILING DATS: 04-NOV-1996 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TGCTTTAGCA ACAGCCTATC AG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TAAACTTCTT CCGGCACTTC G 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TGCGGCTATA AATGAAGAGG C 

(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATCCGATGAT GCTATGGCTT T 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCAGCGGTAT TGTTTGGTGG T 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: , nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

CAGGCGGCCT TTAATAATTT C 

(2) INFORMATION FOR SEQ ID NO: 7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



21 



21 



21 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGATCGAATT CCACATGAAG GTTATTATGA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TCGCTTCTCC CTCAACAATC AAACTATCCT 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactaae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTTCACCAGC TGTATTAGAA GTA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTTCCCTGAA CATTATCTTT GAT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 - 

CAAGAAGGTT GGTTACAACC CAAAGA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AGGTCTTACC AGTAACTTTA CCGGAT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TACTGACAAA CCATTCATGA TG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 



23 



26 



26 



22 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AACTTCGTCA CCAACGCGAA C 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTGGCGCGGT ATGGTCGGTT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCCGACGTTG GAAGTGGTAA AG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17; 
CCGTGTTGAA CGTGGTCAAA TCAAA 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 



21 



20 



22 



25 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TRTGTGGTGT RATWGWRCCA GGAGC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ACAACGTGGW CAAGTWTTAG CWGCT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ACCATTTCWG TACCTTCTGG TAAGT 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 12 

(D) OTHER INFORMATION : / note=* »n - inosine" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAAATTGCAG GNAAATTGAT TGA 



25 



23 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : mis cofeature 

(B) LOCATION: 12 . ...» 
(D) OTHER INFORMATION : /note- -n = inosine 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 
TTACGCATGG CNTGAC T CAT CAT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 3 . n<jiTlpn 
<D) OTHER INFORMATION : /note- -n = inosine 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6 . ndnp „ 
Id! OTHER INFORMATION : /note - "n = inosine 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

IS S?S* Mi /no t e= m - inosine" 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

S! S I «TXO N :/note= »n = inosine" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 15 . ftdne „ 
(D) OTHER INFORMATION : /notes -n * inosine 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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ACNKKNACNG GNGTNGARAT GTT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 6 

(D) OTHER INFORMATION : / note= "„ . inosine „ 
(ix) FEATURE: 

(A) NAME/KEY; misc_f eature 

(B) LOCATION: 9 

«W OTHER INFORMATION : / note= ■„ = inosine" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /not e= »„ = inosine" 
(ix) FEATURE: 

(A) NAME /KEY : misc^f eature 

(B) LOCATION: 18 

(D) OTHER INFORMATION :/note= ■„ s inosine ., 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AYRTTNTCNC CNGGCATNAC CAT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: io base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCGCTTCTCC 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



23 



10 



SUBSTITUTE SHEET (RULE 26) 



WO 98/20157 



- 80 - 



PCT/CA97/00829 ' 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bnterococcus faecium 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TTCTTAGAGA CATTGAATAT GCCTTATGTC GGCGCAGGCG TATTGACCAG TGCATGTGCC 
ATGGATAAAA TCATGACCAA GTATATTTTA CAAGCTGCTG GTGTGCCGCA AGTTCCTTAT 
GTACCAGTAC TTAAGAATCA ATGGAAAGAA AATCCTAAAA AAGTATTTGA TCAATGTGAA 
GGTTCTTTGC TTTATCCGAT GTTTGTCAAA CCTGCGAATA TGGGTTCTAG TGTCGGCATT 
ACAAAGGCAG AAAACCGAGA AGAGCTGCAA AATGCTTTAG CAACAGCCTA TCAGTATGAT 
TCTCGAGCAA TCGTTGAACA AGGAATTGAA GCGCGCGAAA TCGAAGTTGC TGTATTAGGA 
AATGAAGATG TTCGGACGAC TTTGCCTGGC GAAGTCGTAA AAGACGTAGC ATTCTATGAT 
TATGAAGCCA AATATATCAA TAATAAAATC GAAATGCAGA TTCCAGCCGA AGTGCCGGAA 
GAAGTTTATC AAAAAGCGCA AGAGTACGCG AAGTTAGCTT ACACGATGTT AGGTGGAAGC 
GGATTGAGCC GGTGCGATTT CTTTTTGACA AATAAAAATG AATTATTCCT GAATGAATTA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GTGGGATTAA ACAGATTTAT GCGTGCGATG ATGGTGGTTT TCATTACTGC CAATTGCATT 

ACGATTAACC CCGACATAAT ATTTGCAGCG ACAGATAGCG AAGATTCTAG TCTAAACACA 

GATGAATGGG AAGAAGAAAA AACAGAAGAG CAACCAAGCG AGGTAAATAC GGGACCAAGA 

TACGAAACTG CACGTGAAGT AAGTTCACG T GATATTAAAG AACTAGAAAA ATCGAATAAA 

GTGAGAAATA CGAACAAAGC AGACCTAATA GCAATGTTGA AAGAAAAAGC AGAAAAAGGT 

CCAAATATCA ATAATAACAA CAGTGAACAA ACTGAGAATG CGGCTATAAA TGAAGAGGCT 
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TCAGGAGCCG ACCGACCAGC TATACAAGTG GAGCGTCGTC 

AGCGCAGCGG AAATTAAAAA AAGAAGGAAA GCCATAGCAT 

AGCCTTACTT ATCCGGATAA ACCAACAAAA GTAAATAAGA 

GTTGCGGATG CTTCTGAAAG TGACTTAGAT TCTAGCATGC 

CCACAACCTT TAAAAGCAAA CCAACAACCA TTTTTCCCTA 

GATGCGGGGA AATGGGTACG TGATAAAATC GACGAAAATC 

GTTGATAAAA GTGCAGGGTT AATTGACCAA TTATTAACCA 

AATGCTTCGG ACTTCCCGCC ACCACCTACG GATGAAGAGT 

ACACCAATGC TTCTTGGTTT TAATGCTCCT GCTACATCAG 

CCACCACCAC CTACGGATGA AGAGTTAAGA CTTGCTTTGC 

GGTTTTAATG CTCCTGCTAC ATCGGAACCG AGCTCGTTCG 

GAAGATGAAC TAGAAATCAT CCGGGAAACA GCATCCTCGC 

GGGGA1TTAG CTAGTTTGAG AAATGCTATT AATCGCCATA 

CCACCAATCC CAACAGAAGA AGAGTTGAAC GGGAGAGGCG 

TTTAGTTCGC TGAATAGTGG TGATTTTACA GATGACGAAA 

GAAATTGATC GCCTAGCTGA TTTAAGAGAT AGAGGAACAG 

GGTTTTTTAC CATTAAATCC GTTTGCTAGC AGCCCGGTTC 

TCGAAAATAA GCGACCGGGC TCTGATAAGT GACATAACTA 

CCATCACAGC CATTAAATGT GTTTAATAAA AAAACTACAA 

CCAACCCCTG TAAAGACCGC ACCAAAGCTA GCAGAACTTC 

ACCGTACTTA GGGAAAATAA AACACCCTTT ATAGAAAAAC 

TCAATTAATA TGCCGAGCCT ACCAGTAATC CAAAAAGAAG 

GAAATGAAAC CACAAACCGA GGAAAAAATG GTAGAGGAAA 

AACGGAAAAA ATCGTTCTGC TGGCATTGAA GAAGGAAAAC 

GACGAAAAAG CGAAGGAAGA ACCAGGGAAC CATACGACGT 

ATTGGCGTGT TCTCTTTAGG GGCGTTTATC AAAATTATTC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 415 base pairs 



ATCCAGGATT 
CATCGGATAG 
AAAAAGTGGC 
AGTCAGCAGA 
AAGTATTTAA 
CTGAAGTAAA 
AAAAGAAAAG 
TAAGACTTGC 
AACCGAGCTC 
CAGAGACGCC 
AATTTCCACC 
TAGATTCTAG 
GTCAAAATTT 
GTAGACCAAC 
ACAGCGAGAC 
GAAAACACTC 
CTTCGTTAAG 
AAAAAACGCC 
CGAAAACAGT 
CTGCCACAAA 
AAGCAGAAAC 
CTACAGAGAG 
GCGAATCAGC 
TAATTGCTAA 
TAATTCTTGC 
AATTAAGAAA 



GCCATCGGAT 
TGAGCTTGAA 
GAAAGAGTCA 
TGAGTCTTCA 
AAAAATAAAA 
GAAAGCGATT 
TGAAGAGGTA 
TTTGCCAGAG 
ATTCGAATTT 
AATGCTTCTT 
GCCTCCAACA 
TTTTACAAGA 
CTCTGATTTC 
ATCTGAAGAA 
AACAGAAGAA 
AAGAAATGCG 
TCCAAAGGTA 
ATTTAAGAAT 
GACTAAAAAA 
ACCACAAGAA 
AAACAAGCAG 
CGATAAAGAG 
TAATAACGCA 
AAGTGCAGAA 
AATGTTAGCT 
AAATAATTAA 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TACCGGTACG CTAAATATTG GTGATGTATT GGATATTATG ATTTGGGAAG CGCCGCCAGC 
GGTATTGTTT GGTGGTGGCC TTTCTTCGAT GGGCTCGGGT AGTGCGCAAC AAACCAAGTT 
GCCGGAGCAA CTGGTGACGG CACGTGGTAC GGTTTCTGTG CCGTTTGTTG GCGATATTTC 
GGTGGTCGGT AAAACGCCTG GTCAGGTTCA GGAAATTATT AAAGGCCGCC TGAAAAAAAT 
C^CCAATCAG CCGCAAGTGA TGGTGCGCTT GGTGCAGAAT AATGCGGCAA ATGTATCGGT 
GATTCGCGCA GGCAATAGTG TGCGTATGCC GTTGACGGCA GCCGGTGAGC GTGTGTTGGA 
TGCGGTGGCT GCGGTAGGTG GTTCAACGGC AAATGTGCAG GATACGAATG TGCAG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TCGCTTCTCC AGAAGAAATT TTAGAAACAT ATCTAGAAAA TCCCAAATTA GATAAACCGT 
TTATATTATG TGAATACGCA CATGCAATGG GAAATTCACC AGGAGATCTT AATGCATATC 
AAACATTAAT TGAAAAATAT GATAGTTTTA TTGGCGGTTT TGTTTGGGAA TGGTGTGATC 
ATAGCATTCA GGTTGGGATA AAGGAAGGTA AACCAATTTT TAGATATGGT GGAGATTTTG 
GTGAGGCCTT ACATGACGGT AATTTTTGTG TTGATGGTAT TGTTTCGCCA GATCGAATTC 
CACATGAAGG TTATTATGAG TTTAAACATG AACATAGACC TTTGAGATTG GTTAACGAAG 
AGGATTATCG GTTTACATTG AAGAATCAAT TTGATTTTAC AAATGCGGAG GATAGTTTGA 
TTGTTGAGGG AGAAGCGA 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ATGAACGTTA CACATATGAT GTATCTATCT GGAACTCTAG TGGCTGGTGC ATTGTTATTT 
TCACCAGCTG TATTAGAAGT ACATGCTGAT CAAGTGACAA CTCCACAAGT GGTAAATCAT 
GTAAATAGTA ATAATCAAGC CCAGCAAATG GCTCAAAAGC TTGATCAAGA TAGCATTCAG 
TTGAGAAATA TCAAAGATAA TGTTCAGGGA ACAGATTATG AAAAACCGGT TAATGAGGCT 
ATTACTAGCG TGGAAAAATT AAAGACTTCA TTGCGTGCCA ACCCTGAGAC AGTTTATGAT 
TTGAATTCTA TTGGTAGTCG TGTAGAAGCC TTAACAGATG TGATTGAAGC AATCACTTTT 
TCAACTCAAC ATTTAACAAA TAAGGTTAGT CAAGCAAATA TTGATATGGG ATTTGGGATA 
ACTAAGCTAG TTATTCGCAT TTTAGATCCA TTTGCTTCAG TTGATTCAAT TAAAGCTCAA 
GTTAACGATG TAAAGGCATT AGAACAAAAA GTTTTAACTT ATCCTGATTT AAAACCAACT 
CATAGAGCTA CCATCTATAC AAAATCAAAA CTTGATAAGG AAATCTGGAA TACACGCTTT 
ACTAGAGATA AAAAAGTACT TAACGTCAAA GAATTTAAAG TTTACAATAC TTTAAATAAA 
GCAATCACAC ATGCTGTTGG AGTTCAGTTG AATCCAAATG TTACGGTACA ACAAGTTGAT 
CAAGAGATTG TAACATTACA AGCAGCACTT CAAACAGCAT TAAAATAA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
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ATGAAAGTAG GTTTCGTCGG CTGGCGCGGT ATGGTCGGTT CGGTTTTGAT GCAGCGTATG 60 
AAAGAAGAAA ACGACTTCGC CCACATTCCC GAAGCGTTTT TCTTTACCAC TTCCAACGTC 120 
GGCGGCGCAC GCCCTGATTT CGGTCAGGCG GCTAAAACAT TATTGGACGC GAACAACGTT 
GCCGAGCTGG CAAAAATGGA CATCATCGTT ACCTGCCAAG GCGGCGACTA CACCAAATCC 
GTCTTCCAAG CCCTGCGCGA CAGCGGCTGG AACGGCTACT GGATTGACGC GGCATCCTCG 
CTGCGTATGA AAGACGACGC GATTATCGTC CTCGACCCCG TCAACCGCAA CGTCATCGAC 
AACGGCCTCA AAAACGGCGT GAAAAACTAC ATCGGCGGCA ACTGTACCGT TTCCCTGATG 



C 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: streptococcus gordonii 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TTCATAGACG CTGAGCACGC TTTGGATCCA TCTTACGCGG CTGCTCTAGG TGTAAATATT 
GATGAGCTGT TGCTATCTCA ACCAGATTCT GGTGAGCAAG GTTTAGAAAT TGCAGGAAAA 
TTGATTGACT CTGGGGCAGT TGATTTAGTT GTCATCGACT CTGTTGCAGC TCTTGTACCA 
CGTGCGGAAA TCGATGGAGA TATCGGTGAT AGC 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus /nutans 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GGGCCGGAAT CTTCTGGTAA GACAACTGTC GCTCTTCATG CTGCTGCTCA GGCGCAAAAA 
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GATGGCGGTA TTGCCGCTTT CATTGATGCA GAACATGCCC TTGATCCAGC CTATGCTGCT 
GCTCTTGGCG TTAATATTGA TGAGCTTTTG CTTTCACAAC CAGATTCAGG AGAACAGGGT 
CTTGAAATTG CAGGGAAATT GATTGATTCT GGCGCTGTTG ATTTAGTTGT TGTTGACTCA 
GTGGCAGCTT TAGTACCACG TGCGGAGATT GACGGAGATA TTGGTAATAG TCATGTTGGC 
TTACAAGCAC GCATGATGAG TCAAGCGATG CGTAAATTAT CAGCTTCAAT CAATAAAACA 
AAAACCATTG CTATTTTTAT TAATCAATTG CGGGAAAAAG TTGGTATTAT GTTTGGTAAT 
CCAGAAACAA CCCCTGGCGG GCGTGCCTTG AAGTTTTATT CTTCTGTGCG TCTTGATGTC 
CGCGGCAATA CTCAAATTAA AGGAACCGGG GAACAAAAAG ACAGCAATAT TGGTAAAGAG 
ACCAAAATTA AAGTTGTTAA AAATAAAGTT GCTCCACCAT TTAAGGAAGC TTTTGTAGAA 
ATTATATATG GTGAAGGCAT TTCTCGTACA GGTGAATTAG TTAAGATTGC CAGTGATTTG 
GGAATTATCC AAAAAGCTGG AGCTTGGTAC TC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34; 
ATGGCGAAAA AACCAAAAAA ATTAGAAGAA ATTTCAAAAA AATTTGGGGC AGAACGTGAA 
AAGGCCTTGA ATGACGCTCT TAAATTGATT GAGAAAGACT TTGGTAAAGG ATCAATCATG 
CGTTTGGGTG AACGTGCGGA GCAAAAGGTG CAAGTGATGA GCTCAGGTTC TTTAGCTCTT 
GACATT^CCC TTGGCTCAGG TGGTTATCCT AAGGGACGTA TCATCGAAAT CTATGGCCCA 
GAGTCATCTG GTAAGACAAC GGTTGCCCTT CATGCAGTTG CACAAGCGCA AAAAGAAGGT 
GGGATTGCTG CCTTTATCGA TGCGGAACAT GCCCTTGATC CAGCTTATGC TGCGGCCCTT 
GGTGTCAATA TTGACGAATT GCTCTTGTCT CAACCAGACT CAGGAGAGCA AGGTCTTCAG 
ATTGCGGGAA AATTGATTGA CTCAGGTGCA GTTGATCTTG TCGTAGTCGA CTCAGTTGCT 
GCCCT^TTC CTCGTGCGGA AATTGATGGA GATATCGGAG ATAGCCATGT TGGTTTGCAG 
GCTCGTATGA TGAGCCAGGC CATGCGTAAA CTTGGCGCCT CTATCAATAA AACCAAAACA 
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ATTGCCATTT TTATCAACCA ATTGCGTGAA AAAGTTGGAG TGATGTTTGG AAATCCAGAA 
ACAACACCGG GCGGACGTGC TTTGAAATTC TATGCTTCAG TCCGCTTGGA TGTTCGTGGT 
AATACACAAA TTAAGGGAAC TGGTGATCAA AAAGAAACCA ATGTCGGTAA AGAAACTAAG 
ATTAAGGTTG TAAAAAATAA GGTAGCTCCA CCGTTTAAGG AAGCCGTAGT TGAAATTATG 
TACGGAGAAG GAATTTCTAA GACTGGTGAG CTTTTGAAGA TTGCAAGCGA TTTGGATATT 
ATCAAAAAAG CAGGGGCTTG GTATTCTTAC AAAGATGAAA AAATTGGGCA AGGTTCTGAG 
AATGCTAAGA AATACTTGGC AGAGCACCCA GAAATCTTTG ATGAAATTGA TAAGCAAGTC 
CGTTCTAAAT TTGGCTTGAT TGATGGAGAA GAAGTTTCAG AACAAGATAC TGAAAACAAA 
AAAGATGAGC CAAAGAAAGA AGAAGCAGTG AATGAAGAAG TTCCGCTTGA CTTAGGCGAT 
GAACTTGAAA TCGAAATTGA AGAATAAGCT GTTAAAGCAG TGGAGAAATC CGCTACTTTT 
TCGA 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATGCGTTCAG GAAGTCTAGC TCTTGATATT GCTTGGATAG CTGGTGGTTA TCCTAAAGGA 

CGTATCATCG AAATCTATGG TCCAGAGTCT TCCGGTAAAA CGACTGTGGC TTTACATGCT 

GTAGCACAAG CTCAAAAAGA AGGTGGAATC GCAGCCTTTA TCGATGCCGA GCATGCGCTT 

GATCCAGCTT ATGCTGCTGC GCTTGGGGTT AATATTGATG AACTTCTCTT GTCTCAACCA 

GATTCTGGAG AACAAGGACT TGAAATTGCA GGTAAATTGA TTGATTCTGG TGCGGTTGAC 

CTGGTTGTTG TCGATTCAGT AGCAGCTTTA GTGCCACGTG CTGAAATTGA TGGTGATATT 

GGCGATAGCC ATGTCGGATT GCAAGCACGT ATGATGAGTC AGGCCATGCG TAAATTATCA 

GCTTCTATTA ATAAAACAAA AACTATCGCA ATCTTTATCA ACCAATTGCG TGAAAAAGTT 

GGTGTGATGT TTGGAAATCC TGAAACAACA CCAGGTGGTC GAGCTTTGAA ATTCTATGCT 

TCTGTTCGGC TGGATGTGCG TGGAAACAAC CAAATTAAAG GAACTGGTGA CCAAAAGATA 
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GCCAGCATTG GTAAGGAGAC CAAAATCAAG GTTGTTAAAA ACAAGGTCGC TCCGCCATTT 
AAGGTAGCAG AAGTTGAAAT CATGTATGGG GAAGGTATTT CTCGTACAGG GGAGCTTGTG 
AAAATTGCTT CTGATTTGGA CATTATCCAA AAAGCAGGTG CTTGGTTCTC TTATAATGGT 
GAGAAGATTG GCCAAGGTTC TGAAAATGCT AAGCGTTATT TGGCCGATCA TCCACAATTG 
TTTGATGAAA TCGACCGTAA AGTACGTGTT AAATTTGGTT TGCTTGAAGA AAGCGAAGAA 
GAATCTGCTA TGGCAGTAGC ATCAGAAGAA ACCGATGATC TTGCTTTAGA TTTAGATAAT 
GGTATTGAAA TTGAAGATTA A 
(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CTATGTGGCG CGGTATTATC 

(2) INFORMATION FOR SEQ ID NO: 38: 
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GCGTATGCAC GAGCTCTAGG TGTTAATATC GATGAGCTTC TTTTGTCGCA 


GCCTGATTCT 


60 


GGTGAGCAAG GTCTCGAAAT TGCAGGTAAG CTGATTGACT CTGGTGCAGT 


GGATTTAGTT 


120 


GTTGTTGACT CAGTTGCGGC CTTCGTACCA CGTGCAGAAA TTGATGGAGA 


TAGTGGTGAC 


180 


AGTCATGTAG GACTTCAAGC GCGTATGATG AGTCAAGCCA TGCGTAAACT 


TTCTGCATCT 


240 


ATTAATAAAA CAAAAACGAT TGCTATCTTT ATTAACCAGT TGCGTGAAAA 


AGTTGGTATC 


300 


ATGTTTGGTA AC 






(2) INFORMATION FOR SEQ ID NO: 37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CGCAGTGTTA TCACTCATGG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 39: 

CTGAATGAAG CCATACCAAA 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

ATCAGCAATA AACCAGCCAG 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

20 

TTACCATGAG CGATAAC AG C 
(2) INFORMATION FOR SEQ ID NO: 42: 
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(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 20 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CTCATTCAGT TCCGTTTCCC 

(2) INFORMATION FOR SEQ ID NO: 4 3 . 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 s 

CAGCTGCTGC AGTGGATGGT 

(2) INFORMATION FOR SEQ ID NO: 44 . 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44; 

CGCTCTGCTT TGTTATTCGG 

(2) INFORMATION FOR SEQ ID NO: 45 . 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45; 

TACGCCAACA TCGTGGAAAG 

(2) INFORMATION FOR SEQ ID NO: 46: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

20 

TTGAATTTGG CTTCTTCGGT 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

20 

GGGATACAGA AACGGGACAT 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

20 

TAAATCTTTT TCAGGCAGCG 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pair3 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

2 C 

GATGGTTTGA AGGGTTTATT ATAAG 
(2) INFORMATION FOR SEQ ID NO: 50: 
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(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AATTTAGTGT GTTTAGAATG GTGAT 
(2) INFORMATION FOR SEQ ID NO: 51 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 - 
ACTTCAACAC CTGCTGCTTT C 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 52: 
TGACCACTTT TATCAGCAAC C 
(2) INFORMATION FOR SEQ ID NO: 53 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGCAATAGTT GAAATGCTCG 
(2) INFORMATION FOR SEQ ID NO: 54 : 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

20 

CAGCTGTTAC AACGGACTGG 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

20 

TCTATGATCT CGCAGTCTCC 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
ATCGTCACCG TAATCTGCTT 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CATTCTCGAT TGCTTTGCTA 
(2) INFORMATION FOR SEQ ID NO: 58: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CCGAAATGCT TCTCAAGATA 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 - 

CTGGATTATG GCTACGGAGT 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
AGCAGTGTGA TGGTATCCAG 
(2) INFORMATION FOR SEQ ID NO: 61: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

GACTCTTGAT GAAGTGCTGG 

(2) INFORMATION FOR SEQ ID NO: 62: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CTGGTCTATT CCTCGCACTC 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

TATGAGAAGG CAGGATTCGT 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

20 

GCTTTCTCTC GAAGGCTTGT 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

20 

GAGTTGCTGT TCAATGATCC 
(2) INFORMATION FOR SEQ ID NO: 66: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

GTGTTTGAAC CATGTACACG 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TGTAGAGGTC TAGCCCGTGT 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

ACGGGGATAA CGACTGTATG 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

ATAAAGATGA TAGGCCGGTG 

(2) INFORMATION FOR SEQ ID NO: 70: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

20 

TGCTGTCATA TTGTCTTGCC 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

20 

ATTATCTTCG GCGGTTGCTC 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

20 

GACTATCGGC TTCCCATTCC 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

20 

CGATAGAAGC AGCAGGACAA 
(2) INFORMATION FOR SEQ ID NO: 74: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

CTGATGGATG CGGAAGATAC 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

GCCTTATGTA TGAACAAATG G 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii); MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GTGACTTTWG TGATCCCTTT TGA 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

TCCAATCATT GCACAAAATC 

(2) INFORMATION FOR SEQ ID NO: 78: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
AATTCCCTCT ATTTGGTGGT 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
TCCCAAGCCA GTAAAGCTAA 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) .MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

TGGTTTTTCA ACTTCTTCCA 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 81: 

20 

TCATAGAATG GATGG CTCAA 
(2) INFORMATION FOR SEQ ID NO: 82: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

AGCTACTATT GCACCATCCC 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CAATAAGGGC ATACCAAAAA TC 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTTAACATT TGTGGCATTA TC 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TTGGGAAGAT GAAGTTTTTA GA 

(2) INFORMATION FOR SEQ ID NO: 86: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

22 

CCTTTACTCC AATAATTTGG CT 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 87: 

20 

TTTCATCTAT TCAGGATGGG 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

20 

GGAGCAACAT TCTTTGTGAC 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

20 

TGTGCCTGAA GAAGGTATTG 
(2) INFORMATION FOR SEQ ID NO: 90: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

CGTGTTACTT CACCACCACT 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TATCTTATCG TTGAGAAGGG ATT 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
CTACACTTGG CTTAGGATGA AA 
(2) INFORMATION FOR SEQ ID NO; 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 
CTATCTGATT GTTGAAGAAG GATT 
(2) INFORMATION FOR SEQ ID NO: 94: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GTTTACTCTT GGTTTAGGAT GAAA 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

CTTGTTGATC ACGATAATTT CC 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

ATCTTTTAGC AAACCCGTAT TC 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

27 

AACAGGTGAA TTATTAG C AC TTGTAAG 
(2) INFORMATION FOR SEQ ID. NO: 98: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

ATTGCTGTTA ATATTTTTTG AGTTGAA 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GTGATCGAAA TCCAGATCC 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

ATCCTCGGTT TTCTGGAAG 

(2) INFORMATION FOR SEQ ID NO: 101: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTCATAC ATGTGATGG 

(2) INFORMATION FOR SEQ ID NO: 102: 
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(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 102: 
GATGTTACCC GAGAGCTTG 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
( Xi , SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TTAAGCGTGC ATAATAAGCC 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
TTGCGATTAC TTCGCCAACT 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
TTTACTAAGC TTGCCCCTTC 
(2) INFORMATION FOR SEQ ID NO: 106: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE : DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
AAAAGGCAGC AATTATGAGC 

20 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 9 

(D) OTHER INFORMATION : /note= »n = inosine" 
(ix) FEATURE: 

(A) NAME/KEY: misc_£eature 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note= »n =, inosine" 
(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note* - n = inosine" 
(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 16 

■-■(D) OTHER INFORMATION: /note- »n - inosine" 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /not "n = inosine" 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
AAYATGATNA CNGGNGCNGC NCARATGGA 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION : 3 

(D) OTHER INFORMATION: /note= "n = xnosine- 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
<B) LOCATION: 6 

(D) OTHER INFORMATION :/note= "n = mosine 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATIONS 

(D) OTHER INFORMATION: /note = »n = mosine" 

(ix) FEATURE: 

(A) NAME/KEY: miscj eature 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note- "n = mosine 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
CCNACNGTNC KNCCRCCYTC RCG 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 6 

; (D) OTHER INFORMATION: /note* "n = IXlOSine 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note- "n = mosine 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- »n = mosine 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
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(B) LOCATION; 18 

(D) OTHER INFORMATION : / note= »n = inosine" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CARYTNATHG TNGCNGTNAA YAARATGGA 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
ATGAAAAACA CAATACATAT CAACTTCGCT ATTTTTTTAA TAATTGCAAA TATTATCTAC 
AGCAGCGCCA GTGCATCAAC AGATATCTCT ACTGTTGCAT CTCCATTATT TGAAGGAACT 
GAAGGTTGTT TTTTACTTTA CGATGCATCC ACAAACGCTG AAATTGCTCA ATTCAATAAA 
GCAAAGTGTG CAACGCAAAT GGCACCAGAT TCAACTTTCA AGATCGCATT ATCACTTATG 
GCATTTGATG CGGAAATAAT AGATCAGAAA ACCATATTCA AATGGGATAA AACCCCCAAA 
GGAATGGAGA TCTGGAACAG CAATCATACA CCAAAGACGT GGATGCAATT TTCTGTTGTT 
TGGGTTTCGC AAGAAATAAC CCAAAAAATT AGATTAAATA AAATCAAGAA TTATCTCAAA 
GATTTTGATT ATGGAAATCA AGACTTCTCT GGAGATAAAG AAAGAAACAA CGGATTAACA 
GAAGCATGGC T CGAAAG TAG CTTAAAAATT TCACCAGAAG AACAAATTCA ATTCCTGCGT 
AAAATTATTA ATCACAATCT CCCAGTTAAA AACTCAGCCA TAGAAAACAC CATAGAGAAC 
ATGTATCTAC AAGATCTGGA TAATAGTACA AAACTGTATG GGAAAACTGG TGCAGGATTC 
ACAGCAAATA GAACCTTACA AAACGGATGG TTTGAAGGGT TTATTATAAG CAAATCAGGA 
CATAAATATG TTTTTGTGTC CGCACTTACA GGAAACTTGG GGTCGAATTT AACATCAAGC 
ATAAAAGCCA AGAAAAATGC GATCACCATT CTAAACACAC TAAATTTATA A 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TTGAAAAAGT TAATATTTTT AATTGTAATT GCTTTAGTTT TAAGTGCATG TAATTCAAAC 
AGTTCACATG CCAAAGAGTT AAATGATTTA GAAAAAAAAT ATAATGCTCA TATTGGTGTT 
TATGCTTTAG ATACTAAAAG TGGTAAGGAA GTAAAATTTA ATTCAGATAA GAGATTTGCC 
TATGCTTCAA CTTCAAAAGC GATAAATAGT GCTATTTTGT TAGAACAAGT ACCTTATAAT 
AAGTTAAATA AAAAAGTACA TATTAACAAA GATGATATAG TTGCTTATTC TCCTATTTTA 
GAAAAATATG TAGGAAAAGA TATCACTTTA AAAGCACTTA TTGAGGCTTC AATGACATAT 
AGTGATAATA CAGCAAACAA TAAAATTATA AAAGAAATCG GTGGAATCAA AAAAGTTAAA 
CAACGTCTAA AAGAACTAGG AGATAAAGTA ACAAATCCAG TTAGATATGA GATAGAATTA 
AATTACTATT CACCAAAGAG CAAAAAAGAT ACTTCAACAC CTGCTGCTTT CGGTAAGACT 
TTAAATAAAC TTATCGCAAA TGGAAAATTA AGCAAAGAAA ACAAAAAATT CTTACTTGAT 
TTAATGTTAA ATAATAAAAG CGGAGATACT TTAATTAAAG ACGGTGTTCC AAAAGACTAT 
AAGGTTGCTG ATAAAAGTGG TCAAGCAATA ACATATGCTT CTAGAAATGA TGTTGCTTTT 
GTTTATCCTA AGGGCCAATC TGAACCTATT GTTTTAGTCA TTTTTACGAA TAAAGACAAT 
AAAAGTGATA AGCCAAATGA TAAGTTGATA AGTGAAACCG CCAAGAGTGT AATGAAGGAA 
TTTTAA 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
ATGTCCGCGA GCACCCCCCC CATAACTCTT CGCCTCATGA CCGAGCGCGA CCTGCCGATG 
CTCCATGACT GGCTCAACCG GCCGCACATC GTTGAGTGGT GGGGTGGCGA CGAAGAGCGA 
CCGACTCTTG ATGAAGTGCT GGAACACTAC CTGCCCAGAG CGATGGCGGA AGAGTCCGTA 
ACACCGTACA TCGCAATG CT GGGCGAGGAA CCGATCGGCT ATGCTCAGTC GTACGTCGCG 
CTCGGAAGCG GTGATGGCTG GTGGGAAGAT GAAACTGATC CAGGAGTGCG AGGAATAGAC 
CAGTCTCTGG CTGACCCGAC ACAGTTGAAC AAAGGCCTAG GAACAAGGCT TGTCCGCGCT 
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™*C TflCTGTTCTC GGAGCCCACC CGACGAAGA „^ k ^ 

aacaaccatc coerce ctgc™gag aaggcagga. TCOTOa3<wj mmKm: 

AGCAGGCCTG ACGGCCCCGC GGTTTACATG „»« ^ 
CGCGGTGTTG CCTAA 

(2) INFORMATION FOR SEQ ID NO: 113 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 732 base pairs 
<B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: U 3 . 
ATGAACCAGA AAAACCC.AA AGACACGCAA 

GAAATATTGA ATCACACGAA TATCAGTAAA CAAGACAACG TAATAGAAAT CGGATCAGGA 
AAAGGACATT TTACCAAAQA GCTAGTCAAA A^AGTCGA, , mmmc 
GATGGAGGCT TATOTCAAGT GACTAAAGAA GCGGTAAACC CCTCTGAGAA TATAAAAGTG 
ATTCAAACGG ATATTCTAAA A^C^C CCAAAACATA TAAACTATAA GATATATGGT 
AATATTCCTT ATAACATCAG TACGGATATT GTCAAAAGAA TTACCTTTCA AAGTCAGGCT 
AAA^TAGCT ATCTTATCGT TGAGAAGGGA TTTGCGAAAA GATTGCAAAA TCTGCAACGA 
GCTFTGGGTT TACTATTAAT GGTCGAGATG GATATAAAAA TGCTCAAAAA AGTACCACCA 
CTATATTTTC ATCCTAAGCC AAG^TAGAG TCTGTATTGA TTGTTCTTGA ACGACATCAA 
CCATTGATTT CAAAGAAGGA CTACAAAAAG TATCGATC1T TTGITTATAA G^AAAC 
CGTGAATATC GTGTTCTTTT CACTAAAAAC CAATTCCGAC AGGCTTTGAA GCATGCAAAT 
GTCACTAATA TTAATAAACT ATCGAAGGAA CAATTTCTTT CTATTTTCAA TAGTTACAAA 
TTGTTTCACT AA 

(2) INFORMATION FOR SEQ ID NO: 11 4 . 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
ATGAACAAAA ATATAAAATA TTCTCAAAAC TTTTTAACGA GTGAAAAAGT ACTCAACCAA 
ATAATAAAAC AATTGAATTT AAAAGAAACC GATACCGTTT ACGAAATTGG AACAGGTAAA 
GGGCATTTAA CGACGAAACT GGCTAAAATA AGTAAACAGG TAACGTCTAT TGAATTAGAC 
AGTCATCTAT TCAACTTATC GTCAGAAAAA TTAAAATCGA ATACTCGTGT CACTTTAATT 
CACCAAGATA TTCTACAGTT TCAATTCCCT AACAAACAGA GGTATAAAAT TGTTGGGAAT 
ATTCCTTACC ATTTAAGCAC ACAAATTATT AAAAAAGTGG TTTTTGAAAG CCATGCGTCT 
GACATCTATC TGATTGTTGA AGAAGGATTC TACAAGCGTA CCTTGGATAT TCACCGAACA 
CTAGGGTTGC TCTTGCACAC TCAAGTCTCG ATTCAGCAAT TGCTTAAGCT GCCAGCGGAA 
TGCTTTCATC CTAAACCAAG AGTAAACAGT GTCTTAATAA AACTTACCCG CCATACCACA 
GATGTTCCAG ATAAATATTG GAAGCTATAT ACGTACTTTG TTTCAAAATG GGTCAATCGA 
GAATATCGTC AACTGTTTAC TAAAAATCAG TTTCATCAAG CAATGAAACA CGCCAAAGTA 
AACAATTTAA GTACCGTTAC TXATGAGCAA GTATTGTCTA TTTTTAATAG TTATCTATTA 
TTTAACGGGA GGAAATAA 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
ATGAACGAGA AAAATATAAA ACACAGTCAA AACTTTATTA CTTCAAAACA TAATATAGAT 
AAAATAATGA CAAATATAAG ATTAAATGAA CATGATAATA TCTTTGAAAT CGGCTCAGGA 
AAAGGGCATT TTACCCTTGA ATTAGTACAG AGGTGTAATT TCGTAACTGC CATTGAAATA 
GACCATAAAT TATGCAAAAC TACAGAAAAT AAACTTGTTG ATCACGATAA TTTCCAAGTT 
TTAAACAAGG ATATATTGCA GTTTAAATTT CCTAAAAACC AATCCTATAA AATATTTGGT 
AATATACCTT ATAACATAAG TACGGATATA ATACGCAAAA TTGTTTTTGA TAGTATAGCT 
GATGAGATTT ATTTAATCGT GGAATACGGG TTTGCTAAAA GATTATTAAA TACAAAACGC 
TCATTGGCAT TATTTTTAAT GGCAGAAGTT GATATTTCTA TATTAAGTAT GGTTCCAAGA 
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GAATAXTXXC ATCCTAAACC XAGAGXGAAT AGCTCACTTA TCAGATTAAA TAGAAAAAAA 
TCAAGAATAT CACACAAAGA TAAACAGAAG TATAATTATT TCGTTATGAA ATGGGTTAAC 
AAAGAATACA AGAAAATATT TACAAAAAAT CAATTTAACA ATTCCTTAAA ACATGCAGGA 
ATTGACGATT TAAACAATAT TAGCTTTGAA CAATTCTTAT CTCTTTTCAA TAGCTATAAA 
TTATTTAATA AGTAA 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS : 

CA) LENGTH: 1029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
ATGAATAAAA TAAAAGTCGC AATTATCTTC GGCGGTTGCT CGGAGGAACA TGATGTGTCG 
GTAAAATCCG CAATAGAAAT TGCTGCGAAC ATTAATACTG AAAAATTCGA TCCGCACTAC 
ATCGGAATTA CAAAAAACGG CGTATGGAAG CTATGCAAGA AGCCATGTAC GGAATGGGAA 
GCCGATAGTC TCCCCGCCAT AXXCXCCCCG GATAGGAAAA CGCATGGTCT GCTTGTCATG 
AAAGAAAGAG AATACGAAAC XCGGCGTATT GACGTGGCTT XCCCGGXTIT GCATGGCAAA 
TGCGGGGAGG ATGGTGCGAT ACAGGGTCTG TTTGAATTGT CTGGTAXCCC CXAXGXAGGC 
XGCGAXAXXC AAAGCXCCGC AGCXXGCAXG GACAAAXCAC XGGCCXACAT XCXXACAAAA 
AAXGCGGGCA XCGCCGXCCC CGAAXTXCAA AXGAXTGAAA AAGGXGACAA ACCGGAGGCG 
AGGACGCXXA CCXACCCXGX CXXXGXGAAG CCGGCACGGX CAGGTXCGTC CXXXGGCGXA 
ACCAAAGXAA ACAGXACGGA AGAACXAAAC GCXGCGAXAG AAGCAGCAGG ACAAXATGAT 
GGAAAAAXCX XAAXXGAGCA AGCGAIITCG GGCXGTGAGG XCGGCXGCGC GGXCAXGGGA 
AACGAGGAXG ATTIGAXXGX CGGCGAAGXG GAXCAAAXCC GGIXGAGCCA CGGTAXCTXC 
CGCAXCCAXC AGGAAAACGA GCCGGAAAAA GGCXCAGAGA AXGCGAXGAX XAXCGXTCCA 
GCAGACAXTC CGGXCGAGGA ACGAAAXCGG GXGCAAGAAA CGGCAAAGAA AGXAXATCGG 
GXGCXXGGAX GCAGAGGGCT XGCXCGXGXX GAXCTTXXXX XGCAGGAGGA XGGCGGCAXC 
GXXCXAAACG AGGXCAAXAC CCXGCCCGGX X^ACAXCGX ACAGCCGCXA XCCACGCATG 
GCGGCTGCCG CAGGAAXCAC GCXXCCCGCA CTAATXGACA GCCXGAXXAC AXXGGCGATA 
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GAGAGGTGA 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
ATGAAAAAAA TTGCCGTTTT ATTTGGAGGG AATTCTCCAG AATACTCAGT GTCACTAACC 
TCAGCAGCAA GTGTGATCCA AGCTATTGAC CCGCTGAAAT ATGAAGTAAT GACCATTGGC 
ATCGCACCAA CAATGGATTG GTATTGGTAT CAAGGAAACC TCGCGAATGT TCGCAATGAT 
ACTTGGCTAG AAGATCACAA AAACTGTCAC CAGCTGACTT TTTCTAGCCA AGGATTTATA 
TTAGGAGAAA AACGAATCGT CCCTGATGTC CTCTTTCCAG TCTTGCATGG GAAGTATGGC 
GAGGATGGCT GTATCCAAGG ACTGCTTGAA CTAATGAACC TGCCTTATGT TGGTTGCCAT 
GTCGCTGCCT CCGCATTATG TATGAACAAA TGGCTCTTGC ATCAACTTGC TGATACCATG 
GGAATCGCTA GTGCTCCCAC TTTGCTTTTA TCCCGCTATG AAAACGATCC TGCCACAATC 
GATCGTTTTA TTCAAGACCA TGGATTCCCG ATCTTTATCA AGCCGAATGA AGCCGGTTCT 
TCAAAAGGGA TCACAAAAGT AACTGACAAA ACAGCGCTCC AATCTGCATT AACGACTGCT 
TTTGCTTACG GTTCTACTGT GTTGATCCAA AAGGCGATAG CGGGTATTGA AATTGGCTGC 
GGCATCTTAG GAAATGAGCA ATTGACGATT GGTGCTTGTG ATGCGATTTC TCTTGTCGAC 
GGTTTTTTTG ATTTTGAAGA GAAATACCAA TTAATCAGCG CCACGATCAC TGTCCCAGCA 
CCATTGCCTC TCGCGCTTGA ATCACAGATC AAGGAGCAGG CACAGCTGCT TTATCGAAAC 
TTGGGATTGA CGGGTCTGGC TCGAATCGAT TTTTTCGTCA CCAATCAAGG AGCGATTTAT 
TTAAACGAAA TCAACACCAT GCCGGGATTT ACTGGGCACT CCCGCTACCC AGCTATGATG 
GCGGAAGTCG GGTTATCCTA CGAAATATTA GTAGAGCAAT TGATTGCACT GGCAGAGGAG 
GACAAACGAT G 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
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(D) topology; linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
TGGTGCTATC TTAGTAGTAT CTGCAGCTGA TGGTCCAATG CCTCAAACAC GTGAACACAT 
CTTATTATCA CGTCAAGTAG GTGTTCCTTA CATCGTTGTA TTCTTAAACA AAGTTGACAT 
GGTTGACGAT GAAGAATTAT TAGAATTAGT AGAAATGGAA GTTCGTGACT TATTATCAGA 
ATACGATTTC CCAGGCGATG ACACTCCAGT TGTTGCAGGT TC^CXTTAC GCGCTTTAGA 
AGGCGACGCT TCATACRAAG AAAAAATCTT AGAATTAATG GCTGCTGTTG ACGAATACAT 
TCCAACTCCA GAACGVGACG TTGACAAACC ATTCATGATG CCAGTTGAAG ACGTGTTCTC 
AATCACAGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGTGTTGG 
TGACGAAGTT GAAATCGTTG GTATTTCAGA AGAAACTTCA AAAACAACTG TAACTGGTGT 
TGAAATGTTC CGTAAATTGT TAGACTACGC TGAAGCAGGG GATAACATTG GTACATTATT 
ACGTGGTGTT ACACGTQACA ACATCGAACG TGGACAAGTT C^CTAAAC CAGGAACAAT 
CACTCCACAT ACTAAATTCA AAGCTGAAGT TTACGTATTA ACTAAAGAAG AAGGTGGACG 
TCATACTCCA TTCTTCTCTA ACTACCGTCC TCAATTCTAC TTCCGTACAA CAGACATCAC 
TGGTGTTTGT GTGTTACCAG AAGGCGTTGA AATGGTAATG CCTGGTGATA ACGTAACTAT 
GGAAGTTGAA TTAATTCACC CAGTAGCGA 
(2) INFORMATION FOR SEQ ID NO: 119 . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Abiotrophia defectiva 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 9: 
COOC^TC CTCGTTCTAT CTGCTGCTGA CGCCCCAATG CCACAAACTC ^ 
CCTCTTCTCT OGTCAAGTTG GTGTTCCTTA CATCGTAGTA TTCTTGAACA AAGTTGACAT 120 
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GGTTGACGAC GAAGAATTGC TCGAATTAGT TGAAATGGAA GTTCGTGACC TCTTGTCTGA 
ATACGACTTC CCAGGCGACG ACACTCCAGT TATCGCTGGT TCAGCTTTGA AAGCTTTAGA 
AGGCGACGCT AACTACGAAG CTAAAGTTTT AGAATTGATG GAACAAGTTG ATGCTTACAT 
TCCAGAACCA GAACGTGACA CTGACAAGCC ATTCATGATG CCAGTCGAAG ACGTATTCTC 
TATCACTGGT CGTGGTACTG TTGCAACTGG TCGTGTTGAA CGTGGTCAAG TTCGCGTTGG 
TGACGAAGTT GAAATCGTTG GTATCGAAGA AGAAACTTCT AAGACTACCG TTACCGGTGT 
TGAAATGTTC CGTAAGTTAT TGGATTACGC TGAAGCTGGG GACAACGTTG GTACCTTGTT 
ACGTGGTGTA ACTCGTGACC AAATCCAACG TGGTCAAGTA TTATCTAAAC CAGGTTCAAT 
CACTCCGYAC ACTAAGTTCG AAGCTGAAGT GTACGTATTG TCTAAAGAAG AAGGTGGTCG 
TCACACTCCA TTCTTCTCTA ACTACCGTCC ACAATTCTAC TTCCGTACAA CTGACGTAAC 
TGGTGTTGTT ACTTTACCAG AAGGTACTGA AATGGTTATG CCAGGCGACA ACGTACAAAT 
GGTTGTTGAA TTGATCCACC CAATCGCGAT CGAAGAA 

(2) INFORMATION FOR SEQ ID NO: 120: , 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

CTCTGTCAAA TGGGACAAAA ACAGATTTGA AGAAATCATC AAGGAAACCT CCAACTTCGT 

CAAGAAGGTT GGTTACAACC CAAAGACTGT TCCATTCGTT CCAATCTCTG GTTGGAATGG 

TGACAACWTG ATTGAASCAT CCACCAACTG TCCATGGTAC AAGGGTTGGG AAAAGGAAAC 

CAAATCCGGT AAAGTTACTG GTAAGACCTT GTTAGAAGCT ATTGACGCTA TTGAACCACC 

AACCAGACCA ACCGACAAAC CATTGAGATT GCCATTRCAA GATGTTTACA AGATCGGTGG 

TATTGGTACT GTGCCAGTCG GTAGAGTTGA AACTGGTATC ATCAAAGCCG GTATGGTWGT 

TACTTTCGCC CCAGCTGGTG TTACCACTGA AGTCAARTCC GTTGAAATGC ATCACGAACA 

ATTGGCTGAA GGTGTTCCAG GTGACAATGT TRGTTTCAAC GTTAAGAACR TTTCCGTTAA 

AGAAATTAGA AGAGGTAACG TTTGTGGTGA CTCCAAGAAC GATCCACCAA AGGGTTGTGA 
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CTCTTTCAAT GCCCAAGTCA TTGTTTTGAA CCATCCAGGT CAAATCTCTG CTGGTTACTC 
TCCAGTCTTG GATTGTCACR CTGCCCACAT TGCTTGTAAA ^CGACHCTX TGGTTGAAAA 
GATTGACAGA AGAACTGGTA AGRAATTGGA AGAAAATCCA AAATTCGTCA AATCCGGTGA 
TGCTGCTATC GTCAAGATGG TCCCAACCAA ACCA 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 
(O STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

TCTGTCAAGT GGGATGAATC CAGATTCGCT GAAATCGTTA AGGAAACCTC CAACTTCATC 

AAGAAGGTCG GTTACAACCC AAAGACTGTT CCA^CGTCC CAATCTCTGG TTGGAACGGT 

GACAACATGA TTGAAGCCAC CACCAACGCT TCCTGGTACA AGGGTTGGGA AAAGGAAACC 

AAGGCTGGTG TCGTCAAGGG TAAGACCTTG TTGGAAGCCA TTGACGCTAT CGAACCACCA 

ACCAGACCAA CTGACAAGCC ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT 

ATCGGTACGG TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGCCAGG TATGGTTGTT 

ACCTTCGCCC CAGCTGGTGT TACCACTGAA GTCAAGTCCG TTGAAATGCA CCACGAACAA 

TTGACTGAAG GTTTGCCAGG TGACAACGTT GGTTTCAACG TTAAGAACGT TTCCGTTAAG 

GAAATCAGAA GAGGTAATGT CTGTGGTGAC TCCAAGAACG ACCCACCAAA GGCTGCTGCT 

TCTTTCAACG CTACCGTCAT TGTCTTGAAC CACCOVGGTC AAATCTCTGC TGGTTACTCT 

ccag™ actgtcacac cgcccacatt gcttgtaagt tcgaagaatt gttggaaaag 

AACGACAGAA GATCCGGTAA GAAGTTGGAA GACTCTCCAA AGTTCTTGAA GTCCGGTGAC 
GCTGCTTTGG TTAAGTTCGT TCCATCCAAG CCA 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
CCGTTAAGTG GGATGAAAAC AGATTTGAAG AAATTGTCAA GGAAACCCAA AACTTCATCA 
AGAAGGTTGG TTACAACCCA AAGACTGTTC CATTCGTTCC AATCTCTGGT TGGAATGGTG 
ACAACATGAT TGAAGCATCC ACCAACTGTC CATGGTACAA GGGTTGGACT AAGGAAACCA 
AGGCAGGTGT TGTTAAGGGT AAGACCTTAT TAGAAGCAAT CGATGCTATT GAACCACCTG 
TCAGACCAAC CGAAAAGCCA TTAAGATTAC CATTACAAGA TGTTTACAAG ATTGGTGGTA 
TTGGTACTGT GCCAGTCGGT AGAGTCGAAA CCGGTGTCAT TAAGCCAGGT ATGGTTGTCA 
CTTTTGCTCC AGCAGGTGTC ACCACCGAAG TCAAATCCGT TGAAATGCAC CATGAACAAT 
TAGAACAAGG TGTTCCAGGT GATAACGTTG GTTTCAACGT TAAGAACGTY TCTGTCAAGG 
ATATCAAGAG AGGTAACGTT TGTGGTGACT CCAAGAACGA CCCACCAATG GGTGCAGCTT 
CTTTCAATGC TCAAGTCATT GTCTTGAACC ACCCTGGTCA AATTTCCGCT GGTTACTCTC 
CAGTCTTGGA TTGTCACACT GCCCACATTG CATGTAAGTT CGACGAATTA ATCGAAAAGA 
TTGACAGAAG AACTGGTAAG TCTGTTGAAG ACCATCCAAA GTCYGTCAAG TCTGGTGATG 
CAGCTATCGT CAAGATGGTC CCAACCAAGC CA 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsxlosas 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 123: 

CTCAGTCAAA TGGGACAAGA RCAGATACGA AGAAATTGTC AAGGAAACTT CCAACTTCGT 

CAAGAAGGTT GGTTACAACC CTAAAGCTGT CCCATTCGTC CCAATCTCTG GTTGGAACGG 

TGACAATATG ATTGAACCAT CAACCAACTG TCCATGGTAC AAGGGTTGGG AAAAGGAAAC 

TAAAGCTGGT AAGGTTACCG GTAAGACCTT GTTGGAAGCT ATCGATGCTA TCGARCCACC 
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-CCAGACCA ACTGACAAGC CATTGAGATT GCCATTGCAA GATGTCTACA AGATTGGTGG 
TATTGGAACT GTGCCAGTTG GTAGAGTTGA AACCGGTATC ATCAAGGCTG GTATGGTTGT 
TACTTTTGCC CCAGCTGGTG TTACCACTGA AGTCAAGTCC GTTGAAATGC ACCACGAACA 
ATTGACTGAA GGTGTCCCAG GTGACAATGT TGGTTTCAAC GTCAAGAACG TTTCAGTTAA 
GGAAATCAGA AGAGGTAACG TYTGTGGTGA CTCCAAGAAC GATCCACCAA AGGGATGTGA 
YTCCTTCAAT GCTCAAGTTA TTGTCTTGAA CCACCCAGGT CAAATCTCTG CTGGTTACTC 

ACCAGTCTTG GATTGTCACA CTOCCCACAT TGCTTGTAAA TTCGACACTT TGATTGAAAA 

GATTGACAGA AGAACCGGTA AGAAATTGGA AGWTGAACCA AAATTCATCA AGTCCGGTGA 

TGCTGCYATC GTCAAGATGG TCCCAACCAA GCCA 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Candida tropicalis 

UL) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

TCTGTTAAAT GGGACAARAA CAGATTTGAA GAAATTATCA AGGAAACYTC TAACTTCGTC 

AAGAAGGTTG GTTACAACCC TAAGGCTGTT CCATTCGTTC CAATCTCWGG TTGGAATGGT 

GACAACATGA TTGAAGCTTC TACCAACTGT CCATGGTACA AGGGTTGGGA AAAAGAAACC 

AAGGCTGGTA AGGTTACCGG TAAGACTTTG TTGGAAGCCA TTGATGCTAT TGAACCACCT 

TCAAGACCAA CTGACAAGCC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATTGGTGGT 

ATTGGTACTG TGCCAGTCGG TAGAGTTGAA ACTGGTGTCA TCAAAGCCGG TATGGTTGTT 

ACTTTYGCCC CAGCTGGTGT TACCACTGAA GTCAAATCCG TYGAAATGCA CCACGAACAA 

TTGGCTGAAG GTGTCCCAGG TGACAATGTT GGTTTCAACG TTAAGAACGT TTCTGTTAAA 

GAAATTAGAA GAGGTAACGT TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTGTGAC 

TCTTTCAACG CTCAAGTTAT TGTCTTGAAC CACCCAGGTC AAATYTCTGC TGGTTACTCT 

CCAGTCTTGG ATTGTCACAC TGCTCATATT GCTTGTAAAT TCGACACCTT GGTTGAAAAG 

ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA AATTCGTCAA ATCCGGTGAT 
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GCTGCTATTG TCAAGATGGT TCCAACCAAA CCA 
(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebac cerium accolens 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 125: 
CGGCGCTATC CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC GCGAGCACGT 
TCTGCTTGCT CGCCAGGTTG GCGTTCCTTA CATCCTCGTT GCACTGAACA AGTGCGACAT 
GGTTGATGAT GAGGAAATCA TCGAGCTCGT GGAGATGGAG ATCTCCGAGC TGCTCGCAGA 
GCAGGACTAC GATGAGGAAG CTCCTATCGT TCACATCTCC GCTCTGAAGG CACTCGAGGG 
TGACGAGAAG TGGGTACAGT CCATCGTTGA CCTGATGGAT GCCTGCGACA ACTCCATCCC 
TGATCCGGAG CGCGCTACCG ATCAGCCGTT CTTGATGCCT ATCGAGGACA TCTTCACCAT 
TACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT GGTCGTCTGA ACGTCAACGA 
GGACGTTGAG ATCATCGGTA TCCAGGAGAA GTCCCAGAAC ACCACCGrTA CCGGTATCGA 
GATGTTCCGC AAGATGATGG ACT AC AC CG A GGCTGGCGAC AACTGTGGTC TGCTTCTGCG 
TGGTACCAAG CGTGAGGACG TTGAGCGTGG CCAGGTTGTT ATCAAGCCGG GCGCTTACAC 
CCCTCACACC AAGTTCGAGG GTTCCGTCTA CGTCCTGAAG AAGGAAGAGG GCGGCCGCCA 
CACCCCGYTC ATGAACAACT ACCGTCCTCA GTTCTACTTC CGCACCACCG ACGTTACCGG 
TGTTGTGAAC CTGCCTGAGG GCACCGAGAT GGTTATGCCT GGCGACAACG TTGAGATGTC 
TGTTGAGCTC ATCCAGCCTG TTGCTATGGA CGAG 
(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM : Corynebacterium diphteriae 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

CGGCGCAATC ctcgttgttg ctgccaccga cggcccaatg cctcagaccc gtgagcacgt 

TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CATCCTCGTT GCTCTGAACA AGTGCGACAT 

GGTTGATGAT gaggaaatca TCGAGCTCGT cgagatggag atccrtgagc tgctcgctga 

GCAGGATTAC GACGAAGAGG CTCCAATCAT CCACATCTCC GCACTGAAGG CTCTTGAGGG 
CGACGAGAAG TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 
AGACCCAGAG CGTGAGACCG ACAAGCCATT CCTCATGCCT ATCGAGGACA TCTTCACCAT 
CACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT GGCTCCCTGA AGGTCAACGA 
GGACGTCGAG ATCATCGGTA TCCGCGAGAA KGCTACCACC ACCACCGTTA CCGGTATCGA 
GATGTTCCGT AAGCTTCTCG ACTACACCGA GGCTGGCGAC AACTGTGGTC TGCTTCTCCG 
TGGCGTTAAG CGCGAAGACG TTGAGCGTGG CCAGGTTGTT GTTAAGCCAG GCGCTTACAC 
CCCTCACACC GAGTTCGAGG GCTCTGTCTA CGTTCTGTCC AAGGACGAGG GTGGCCGCCA 

CAccccArrc ttcgacaact accgcccaca GTTCTACTTC CGCACCACCG acgttaccgg 

TGTTGTGAAG CTTCCTGAGG GCACCGAGAT GGTCATGCCT GGCGACAACG TCGACATGTC 

cgtcaccctg atccagcctg tcgctatgga tgag 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Cozynebacterium genitalia 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
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GGATCCGGAG CGCGAGACCG ACAAGCCGTT CCTGATGCCG GTTGRGGACA TCTTCACCAT 
XACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT GGCGTCCTGA ACCTGAACGA 
CGAGGTCGAG ATCCTGGGCA TCCGCGAGAA GTCCACCAAG ACCACCGTTA CCTCCATCGA 
GATGTTCAAC AAGCTGCTGG ACACCGCAGA GGCTGGCGAC AACGCCGCAC TGCTGCTGCG 
TGGCCTGAAG CGCGAAGATG TTGAGCGTGG TCAGATCGTT GCTAAGCCGG GCGAGTACAC 
CCCGCACACC GAGTTCGAGG GCTCCGTCTA CGTTCTGTCC AAGGACGAGG GTGGCCGCCA 
CACCCCGTTC TTCGACAACT ACCGTCCGCA GTTCTATTTC CGCACCACCG ACGTTACCGG 
TGTTGTGAAG CTGCCGGAGG GCACCGAGAT GGTTATGCCG GGCGACAACG TTGACATGTC 
CGTCACCCTG ATCCAGCCGG TTGCTATGGA CGAG 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: ..^w,,-,,.- 
(A) ORGANISM: corymebactenxm jeikexm 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
CGGCGCCATC CTGGTTGTTG CCGCAACCGA TGGCCCGATG CCGCAGACCC GCGAGCACGT 
TCTGCTGGCY CGCCAGGTTG GCGTTCCGTA CATCCTGGTT GCACTGAACA AGTGTGACAT 
GGTTGACGAT GAGGAGCTGC TGGAGCTCGT CGAGATGGAG GTCCGCGAGC TGCTGGCTGA 
GCAGGACTTC GACGAGGAAG CTCCGGTTGT TCACATCTCC GCACTGAAGG CCCTGGAGGG 
CGACGAGAAG TGGGCTAACC AGATTCTCGA GCTGATGCAG GCTTGCGACG AGTCTATCCC 
GGATCCGGAG CGCGAGACCG ACAAGCCGTT CCTGATGCCG GTTGWGGACA TCTTCACCAT 
TACCGGTCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT GGCATCCTGA ACCTGAACGA 
CGAGGTTGAG ATCCTGGGTA TCCGCGAGAA GTCCCAGAAG ACCACCGTTA CCTCCATCGA 
GATGTTCAAC AAGCTGCTGG ACACCGCAGA GGCTGGCRAC AACGCTGCAC TGCTGCTGCG 
TGGTCTGAAG CGCGAGGACG TTGAGCGTGG CCAGATCATC GCTAAGCCGG GCGAGTACAC 
CCCGCACACC GAGTTCGAGG GCTCCGTCTA CGTTCTGTCC AAGGACGAGG GCGGCCGCCA 
CACCCCGTTC TTCGACAACT ACCGTCCGCA GTTCTACTTC CGCACCACCG ACGTTACCGG 
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<-«». C^CCGAGG GCACCGAGAT ^ 
CGTCACCCTG ATCCAGCCGG TTGCTATGGA CGAG 
(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE • 

(») GBGAMSH, Co^,«« rJ „ p« U[tedifJ , terltic „ 

(xi) SIQUEKX DESCRIPTION: SEO ID HO: m , 
CGGCGGTATC TIGGTTGTTG CAGCTACCGA CGGOXAAIG CCACAGACTC GCGAGCACGT 
TGTGCTGGCT CCCCAGG™ GCGI.CC™ ^ 
GGTTGACGAC GAGGAAATCC rcGAGCICGT CCAGAI^ iI0CGCM „ 
CCAGCAAITC CACCAACAAC CTCCAAT03T rCACAICTCC GCAGTCGGCG CCTOgm<b 

<*»™™ «™»»T GCTTGTGACG AGTCCATCCC 
TCATCCAGAC CGTGCTACCG ACAAGCCATT CCTGATGCCT ATCGAGGACA IGTTCACCAT 
TACCGGTCGT GGCACCGTIG TTACGGGTCG TGTTGAGCGT GGTTCCCTGA AGGTCAACGA 
AGAAGTCGAG ATCATGGGCA TCAAGGAAAA GTCGCAGAAG ACCACCATCA COGGTATCGA 
AATGTTCCGC AAGATGCTGG ACTACACCGA GGGCGGGGAC AACGCTGGTC TGCTGCTTCG 
CQGTACCAAG CGTGAAGACG TTGAGCGTGG ACAGGTTATC G7TGCTCCAG GTGCTTACAG 
CACCCACAAG AAGTTCGAAG GTTCCGTCTA CGHCITTCC AAGGACGAGG GCGGCCGCCA 

caccccg™ ttccacaaci accgtcctca gttctacttc cggacgaccc acgxtaccgg 

TGTTGTTACC CTGCCTGAGG GCACCGAG 
(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH : 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE: DNA (genomic) - 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Corynebacterium striatum 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 
GGCGCTATCT TGGTTGTTGC TGCAACCGAT GGCCCGRTGC CGCAGACCCG CGAGCACGTT 
CTTCTGGCTC GCCAGGTTGG CGTTCCTTAC ATCCTCGTTG CACTGAACAA GTGCGACATG 
GTTGACGACG AGGAAATTAT CGAGCTCGTC GAGATGGAGA TCCGCGAACT GCTCGCAGAG 
CAGGACTACG ATGAGGAAGC TCCGATCGTT CACATCTCTG CTCTGAAGGC TCTTGAGGGC 
GRCGAGAAGT GGGTACAGGC TATCGTTGAC CTGATGCAGG CTTGCGATGA CTCCATCCCG 
GATCCGGAGC GCGAGCTGGA CAAGCCGTTC CTGATGCCAA TCGAGGACAT CTTCACCATC 
ACCGGCCGCG GTACCGTTGT TACTGGCCGT GTTGAGCGTG GCTCCCTGAA CGTCAACGAG 
GACGTTGAGA TCATCGGTAT CCAGGACARG TCCATCTCCA CCACCGTTAC CGGTATCGAG 
ATGYTCCGCA AGATGATGGA CTACACCGAG GCTGGCGACA ACTGTGGTCT GCTTCTGCGT 
GGTACCAAGC GTGAAGAGGT TGAGCGCGGC CAGGTTGTTA TTAAGCCGGG CGCTTACACC 
CCTCACACCC AGTTCGAGGG TTCCGTCTAC GTCCTGAAGA AGGAAGAGGG CGGCCGCCAC 
ACCCCGTTCA TGGACAACTA CCGTCCGCAG TTCTACTTCC GCACCACCGA CGTTACCGGC 
GTCATCAAGC TGCCTGAGGG CACCGAGATG GTTATGCCTG GCGACAACGT CGAGATGTCY 
GTCGAGCTGA TCCAGCCGGT CGCTATGGAC GAG 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(Xi) - SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCTATG CCTCAAACTC GTGAACACAT 

CTTGTTATCT CGTAACGTTG GTGTTCCTTA CATCGTTGTA TTCTTAAACA AAATGGATAT 

GGTTGACGAT GAAGAATTAC TTGAATTAGT TGAAATGGAA GTTCGTGACT TATTAACTGA 

ATACGACTTC CCAGGCGACG ACACTCCAGT TATCGCAGGT TCAGCGTTGA AAGCTTTAGA 

AGGCGACGCT TCATACGAAG AAAAAATCTT AGAATTAATG GCTGCTGTTG ACGAATATAT 
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CCCAACACCA GTTCGTGATA CXGACAAACC ATTCATGATG CCAGTCGAAG ACGTATTCTC 
AATCACTGGT CGTGGTACTG TTGCAACTGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG 
TGACGAAGTT GAAATCGTAG GTATCGCTGA CGAAACTGCT AAAACAACTG TTACAGGTGT 
TGAAATGTTC CGTAAATTGT TAGACTACGC TGAAGCAGGT GACAACATCG GTGCTTTGTT 
ACGTGGTGTT GCACGTGAAG ATATCCAACG TGGACAAGTA TTGGCTAAAC CAGCTTCAAT 
CACTCCACAX ACAAAATTCT CTGCAGAAGT TTATGTTCTA ACTAAAGAAG AAGGTGGACG 
TCATACTCCA TTCTTCACTA ACXACCGTCC TCAGTTCTAC XTCCGTACAA CTGACGTAAC 
TGGTGTAGTT GATCTACCAG AAGGTACTGA AATGGTWATG CCTGGGGATA ACGTAACTAT 
GGAAGTTGAA TTGATYCACC CAATYGCGGT AGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterocoacus fae Ca li s 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACAC GTGAACATAT 

CTTATTATCA CGTAACGTTG GTGTACCATA CATCGTTGTA TTCTTAAACA AAATGGATAT 

GGTTGATGAC GAAGAATTAT TAGAATTAGT AGAAATGGAA GTTCGTGACT TATTATCAGA 

ATACGATTTC CCAGGCGATG ATGTTCCAGT TATCGCAGGT TCTGCTTTGA AAGCTTTAGA 

AGGCGACGAG TCTTATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 

CCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG ACGTATTCTC 

AATCACTGGA CGTGGTACTG TTGCTACAGG ACGTGTTGAA CGTGGTGAAG TTCGCGTTGG 

TGACGAAGTT GAAATCGTTG GTATTAAAGA CGAAACATCT AAAACAACYG TTACAGGTGT 

TGAAATGTTC CGTAAATTAT TAGACTACGC TGAAGCAGGC GACAACMTCG GTGCTTTATT 

ACGTGGTGTA GCACGTGAAG ATATCGAACG TGGACAAGTA TTAGCTAAAC CAGCTACAAT 

CACTCCACAC ACAAAATTCA AAGCTGAAGT ATACGTATTA TCAAAAGAAG AAGGCGGACG 

TCACACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC TTCCGTACAA CAGACGTTAC 
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TGGTGTTGTA GAATTGCCAG AAGGTACTGA AATGGTAATG CCTGGTGATA ACGTTGCTAT 
GGACGTTGAA TTAATTCACC CAATCGCTAT CGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 133: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecivm 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
CGGAGCTATC TTGGTAGTTT CTGCTGCTGA CGGCCCAAI* CCTCAAACTC GTGAACACAT 
CCTATTGTCT CGTCAAGTTG GTGTTCCTTA CATCGTTGTA TTCTTGAACA AAGTAGACAT 
GGTTGATGAC GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTAACAGA 
ATACRAATTC CCTGGTGRCG ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA AAGCTCTAGA 
AGGCGACGCT TCATACGAAG AAAAAATTCT TGAATTAATG GCTGCAGTTG ACGAATACAT 
CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG ACGTGTTCTC 
AATTACTGGA CGTGGTACTG TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG 
TGACGAAGTT GAAGTTGTTG GTATTGCTGA AGAAACTTCA AAAACAACAG TTACTGGTGT 
TGAAATGTTC CGTAAATTGT TAGACYACGC TGAAGCTGGA GACRACATTG GTGCTTTACT 
ACGTGGTGTT GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 
CACACCTCRT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG AAGGTGGACG 
TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC TTCCGTACAA CTGACGTAAC 
AGGTGTTGTT GAATTACCAG AAGGAACTGA AATGGTCATG CCCGGTGACA ACGT 
(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 809 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Enterococcus 9 alli narum 
Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 134 . 
CGGTGCGATC TTAGTAGTAT CTGCTGCTGA CGGTCCTATG CCTCAAACTC GTGAACACAT 60 
CTTGTTATCA CGTAACGTTG GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT 
GGTTGAYGAC GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 
ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCCGGT TCTGCTTTGA AAG CTCTTGA 
AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG GCTGCAGTTG ACGAATACGT 
TCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG ACGTATTCTC 
AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG 
TGATGAAGTA GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 
TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT 
ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACAAT 
CACACCTCAT ACAAAAXTCA AAGCTGAAGT TTATGTTTXG ACAAAAGAAG AAGGXGGACG 
TCACACTCCA TTCTTCACTA ACXACCGTCC TCAGTTCTAC TTCCGTACAA CTGACGTAAC 
TGGTGTTGTT GAATXACCAG AAGGAACTGA AATGGTGATG CCTGGCGACA ACGTGACCAT 
CGACGTTGAA TTGATRCACC CAATCGCTC 
(2) INFORMATION FOR SEQ ID NO: 135 : 

(U SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) organism: Gardnerella va^inalia 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 : 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

809 



TGGCGCAATC 


CTCGTGGTTG 


CTGCTACCGA 


CGGTCCAATG 


GCTCAGACCC 


GTGAACACGT 


60 


CTTGCTTGCT 


AAGCAGGTCG 


GCGTTCCAAA 


AATTCTTGTT 


GCTTTGAACA 


AGTGCGATAT 


120 


GGTTGACGAC 


GAAGAGCTTA 


TCGATCTCGT 


TGAAGAAGAG 


GTCCGTGACC 


TCCTCGAAGA 


180 


AAACGGCTTC 


GATCGCGATT 


GCCCAGTCYT 


CCGTACTTCC 


GCTTACGGCG 


CTTTGCATGA 


240 


TGACGCTCCA 


GACCACGACA 


AGTGGGTAGA 


GACCGTCAAG 


GAACTCATGA 


AGGCTGTTGA 


300 



SUBSTITUTE SHEET (RULE 26) 



PCT/CA97/00829 - 

WO 98/20157 



126 



CGAGTACATC CCAACCCCAA CTCACGATCT ******* TTCrTGATGC CAATCGAAGA 
^TTCACC ATCTCCGGTC GTGGTYCCGT TGTCACCGGT CGTGTTGAGC GTGGTAAGCT 
CCCAATCAAC ACCCCAGTTG AGATCGTTGG TTTGCGCGAT ACCCAGACCA CCACCGTCAC 
CTCTATCGAG ACCTTCCACA AGCAGATGGA TGAGGCAGAG GCTGGCGATA ACACTGGTCT 
TCTTCTCCGC GGTATCAACC GTACCGACGT TGAGCGTGGT CAGGTTGTGG CTGCTCCAGG 
TTCTGTGACT CCACACACCA AGTTCGAAGG CGAAGTTTAC GTCTTGACCA AGGACGAAGG 
TGGC CGTCAC TCGCCATTCT TCTCCAACTA CCGTCCACAG TTCTACTTCC GTACCACCGA 
TGTTACTGGC GTTATCACCT TGCCAGACGG CATCGAAATG GTTCAGCCAG GCGATCACGC 
AACCTTCACT GTTGAGTTGA TCCAGGCTAT CGCAATGGAA GAG 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria innocua 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC GTGAACATAT 

CTTACTTTCA CGTCAAGTTG GTGTTCCATA CATCGTTGTA TTCATGAACA AATGTGACAT 

GGTTGACGAT GAAGAATTAC TAGAATTAGT TGAAATGGAA ATTCGTGATC TATTAACTGA 

ATATGAATTC CCTGGCGATG ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA 

AGGTGAAGCT GACTGGGAAG CTAAAATTGA CGAGTTAATG GAAGCTGTAG ATTCTTACAT 

TC CAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTTGAGG ATGTATTCTC 

AATCACTGGT CGTGGAACAG TTGCAACTGG ACGTGTTGAA CGTGGACAAG TTAAAGTTGG 

TGACGAAGTA GAAGTTATCG GTATTGAAGA AGAAAGCAAA AAAGTAGTAG TAACTGGAGT 

AGAAATGTTC CGTAAATTAC TAGACTACGC TGAAGCTGGC GACAACATTG GCGCACTTCT 

ACGTGGTGTT GCTCGTGAAG ATATCCAACG TGGTCAAGTA TTAGCTAAAC CAGGTTCGAT 

T ACTCCACAC ACTAACTTCA AAGCTGAAAC TTATGTTTTA ACTAAAGAAG AAGGTGGACG 

TCACACTCCA TTCTTCAACA ACTACCGCCC ACAATTCTAT TTCCGTACTA CTGACGTAAC 
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TGGTATTGTT ACACTTCCAG AAGGTACTGA AATGGTAATG CCTGGTGATA ACATTGAGCT 
TGCAGTTGAA CTAATTGCAC CAATCGCTAT CGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 137 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 : 
CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC GTGAACATAT 
TCTTACI^C ACGTCAAGTT GGTGTTCCAT ACATCGTTGT ATTCATGAAC AAATGTGACA 
TGGTTGACGA TGAAGAATTA CTTGAATTAG TTGAAATGGA AATTCGTGAT CTATTAACTG 
AATATGAATT CCCTGGCGAC GACATTCCTG TAATCAAAGG TTCAGCTCTT AAAGCACTTC 
AAGGTGAAGC TGATTGGGAA GCTAAAATTG ACGAGTTAAT GGAAGCTGTA GATTCTTACA 
rrCCAACTCC AGAACGTGAT AC^ACAAAC CATTCATGAT GCCAGTTGAG GATGTATTCT 
CAATCACTGG TCGTGGAACA GTTGCAACTG GACGTGTTGA ACGTGGACAA GTTAAAGTTG 
GTGACGAAGT AGAAGTTATC GGTATTGAAG AAGAAAGCAA AAAAGTAGTA GTAACTGGAG 
TAGAAATGTT CCGTAAATTA CTAGACTACG CTGAAGCTGG CGACAACATT GGCGCACTTC 
TACGTGGTGT TGCTCGTGAA GATATCCAAC GTGGTCAAGT ATTAGCTAAA CCAGGTTCGA 
-ACTCCACA TACTAACTTC AAAGCTGAAA CTTATGTTTT AACTAAAGAA GAAGGTGGAC 
GTCATACTCC ATTCTTCAAC AACTACCGCC CACAATTCTA TTTCCGTACT ACTGACGXAA 
CTGGTATTGT TACACTTCCA GAAGGTACTG AAATGGTAAT GCCTGGTGAT AACATTGAGC 
TTGCAGTTGA ACTAATTGCA CCAATCGCTA TCGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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fv-n ORIGINAL SOURCE: 

(A) ORGANISM : Listeria monocytogenes 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC GTGAACATAT 
CTTACTTTCA CGTCAAGTTG GTGTTCCATA CATCGTTGTA TTCATGAACA AATGTGACAT 
GGTTGACGAT GAAGAATTAC TAGAATTAGT TGAAATGGAA ATTCGTGATC TATTAACTGA 
AT ATGAATTC CCTGGCGATG ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA 
AGGTGAAGCT GACTGGGAAG CTAAAATTGA CGAGTTAATG GAAGCTGTAG ATTCTTACAT 
TC CAACTCCW GAACGTGATA CTGACAAACC ATTCATGATG CCAGTTGAGG ATGTATTCTC 
AATCACTGGT CGTGGAACAG TTGCAACTGG ACGTGTTGAA CGTGGACAAG TTAAAGTTGG 
TGACGAAGTA GAAGTTATCG GTATCGAAGA AGAAAGCAAA AAAGTAGTAG TAACTGGAGT 
AGAAATGTTC CGTAAATTAC TAGACTACGC TGAAGCTGGC GACAACATTG GCGCACTTCT 
ACGTGGTGTT GCTCGTGAAG ATATCCAACR TGGTCAAGTA TTAGCTAAAC CAGGTTCGAT 
TA CTCCACAC ACTAACTTCA AAGCXGAAAC TTATGTTTTA ACTAAAGAAG AAGGTGGACG 
TCACACTCCA TTCTTCAACA ACTACCGCCC ACAATTCTAT TTCCGTACTA CTGACGTAAC 
TGGTATTGTT ACACTTCCAG AAGGTACTGA AATGGTAAYG CCTGGTGATA ACATTGAGCT 
TGCAGTTGAA CTAATTGCAC CAATCGCTAT CGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria seeligen 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC GTGAACATAT 

CTTACTTTCA CGTCAAGTTG GTGTTCCATA CATCGTTGTA TTCATGAACA AATGTGACAT 

' GGTTGACGAT GAAGAATTAC TTGAATTAGT TGAAATGGAA ATTCGTGATC TATTAACTGA 

ATATGAATTC CCTGGTGATG ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA 
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VOXDDVDDXD 0WDDV3VW voxvxxvxoo vxvxowovo omou^o xovxvovoov 

0YXXW3XX0 OX00X03XOO VXXVXSWDX ODXDDWOVX 03VDWDXDD XOOXXEMOOX 
oovxxvxxvo cxooxxvaw ovoxoaioov VDXOD.VXDV OVXXVXXVW XOOOXXDXW 
VDXXOXODVD VXXOXOWOV WVXOXVDVO VOXV.VXXXO SOXVOXVWO xxowewox 
ODXXOVW.X VWOXOaXOO WDXXE3XOOD DDVDVXOOXX DXOVXODXDO xeoxovoxw 
OIOXXVXOOV 30V0XXSV30 DXVDXVOXXV OOWVOVOXO XXVOXOOWS VOOXOWODX 
XV.VXX.VXV OVXOXOOWO OXHVXXWOV XXOXWWW 0W03VXW0 XOOXVOOOOV 

vcvxxxoow hvxxvosvox xodxoooxw xoxoovxdxv dovox DS voo nxxovoxvxv 
VSODVVXXVX X3VDX00XXD woaLvwev xovxxwovx xvxxwswo XVS0V0XX09 
XV3V9XXDW VOVWXXOXX VXOVXOVXXV OOVOOVXOXO O^o^OO VDXVXXXXDX 
XVOVOWOXO OXOVWOVOD ^WOOXSOO V^OOXOOX. XVXOVXOVXX oxvxooxooo 

: on : 0N ai oas ^oixaiaosaa a.NanOas (xx> 

snMW snooooorXtpJeas --wsinvdho (v) 

: aoanos tvnioiho (xa) 

P-CU10U36) VNQ : 3dAJ> gr^a^ (TT) 

^ooroaox (a) 
siqnop : ssaNaaamraxs (o) 

Ptob oxa T onu : adA1 (a) 
SJXPd 3SBq frT8 : H1DK37 <Y) 

: SDixsiaaxovHVHO aoNanoas en 

: 0*T :ON ai 03S HO* NOIXVWHOJNI [Z) 
OVOWSO XVXDDOXWD OVDDXXWXO WOXXOVDDX 

XOOV^XXVOV VXVOXODXOO oxwxodxw voxovxoow ovooudvov xxsxxvxsox 

OWX^VOXO VXOVXODDXX XVXOXXWOV OOOOOOVXOV VOWOXXOXX V30X0V0V0X 
odydsxsow OWOVWXOV VXXXXDXVXX OWVOXOOW TOi XOWXOV XV3V00X0VX 
XV03XX00V0 DWVXOOVXX VXOWDXOOX OOWOOXVXV DWSXSOXO, XXOXO^OOV 
XOXXOVOODD OLXVOWOVO OOOXODWSX OO.VXOVOVX OVXXVWXOD oxxoxvwov 
XOVOOXOWX DVXWXSVW WVDOVW3V vowaxxvxo ooxvxxowo vxowoavox 
OOXXOVWXX 0W0V30X0D WSXXSXDOV 0DX0W30XX OXOW S OXDO XODXOVOXW 
OXOXXVXOXV SSVOXXOVOO OXVOXVOXXV ODVWOVOXO VXVDXOOWS VOOXOWOOX 
XV3VXX0XXV SVXOXOOWO 0XWXX0V03 VOXXWWXO OW^OXOVS XOOWCXSDV 
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CACTCCATTC TTCTCAAACT ATCGTCCACA ATTCTATTTC CGTACTACTG ACGTAACTGG 
TGTTGTTCAC TTACCAGAAG GTACTGAAAT GGTAATGCCT GGTGATAACG TTGAAATGAC 
AGTAGAATTA ATCGCTCCAA TCGCGATTGA AGAC 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: staphylococcus epidermdis 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC GTGAACACAT 
CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA TTCTTAAACA AAGTTGACAT 
GGTAGACGAC GAAGAATTAT TAGAATTAGT TGAAATGGAA GTTCGTGACT TATTAAGCGA 
ATATGACTTC CCAGGTGACG ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA 
AGGCGATGCT GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 
TC CAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG ACGTATTCTC 
AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA CGTGGTCAAA TCAAAGTWGG 
TGAAGAAGTT GAAATCATCG GTATGCACGA AACTTCTAAA ACAACTGTTA CTGGTGTAGA 
AATGTTCCGT AAATTATTAG ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG 
TGGTGTTGCA CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCTATTAC 
AC CACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG GTGGACGTCA 
CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC CRTACTACTG ACGTAACTGG 
TGTTGTAAAC TTACCAGAAG GTACAGAAAT GGTTATGCCT GGCGACAACG TTGAAATGAC 
RGTTGAATTA ATCGCTCCAA TCGCTATCGA AGAC 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCB: 

(A) ORGANISM : Staphylococcus saprophytics 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 : 
CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC GTGAACACAT 
TCTTTTATCA CGTRACGTTG GTGVTCCAGC ATTAGTTGTA TTCTTAAACA AAGTTGACAT 
GGTTGACGAY GAAGAATTAT TAGAATTRGT AGAAATGGAA GTTCGTGRCT TATTAAGCGA 
ATATGACTTC CCAGGTGACG ATGTACCTGT AATCTCTGGT TCTGCATTAA AAGCTTTAGA 
AGGCGACGCT GACTATGAGC AAAAAATCTT AGACTTAATG CAAGCTGTTG ATGACTYCAT 
TCCAACACCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG ACGTATTCTC 
AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA CGTGGTCAAA TCAAAGTCGG 
TGAAGAAATC GARATCATCG GTATGCAAGA AGAATCAAGC AAAACAACTG TTACTGGTGT 
AGAAATGTTC CGTAAATTAT TAGACTACGC TGAAGCTGGT GACAACATTG GTGCATTATT 
ACGTGGTGTT TCACGTGATG ATGTACAACG TGGTCAAGTT TTAGCTGCTC CTGGTACTAT 
CACACCACAT ACAAAATTCA AAGCGGATGT TTACGTTTTA TCTAAAGATG AAGGTGGTCG 
TCATACGCCA TTCTTCACTA ACTACCGCCC ACAATTCTAT TTCCGTACTA CTGACGTAAC 
TGGTGTTGTT AACTTACCAG AAGGTACTGA AATGGTTATG CCTGGCGATA ACGTTGAAAT 
GGATGTTGAA TTAATTTCTC CAATCGCTAT TGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 143 . 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus S i mu l aBS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 . 

TTAGTAGTAT CTGCTOCAGA TGGTCCAATG CCACMACTC GTGAACACAT ,. 

CTTATTA^, CGTAACGTTG GTGTACCAGC TTTAGTTGTA TTCTTAAACA AAGCTGAC.T 12 „ 

GGTTGACGAC GAAGAATTAT TAGAATTAGT TCAAATGGAA GTTCGTGACT TATniTCTGA m 
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ATACGACTTC CCTGGTGACG ATGTACCAGT TATCGTTGGT TCTGCATTAA AAGCTTTAGA 
AGGCGACCCA GAAXACGAAC AAAAAATCTT AGACTTAATG CAAGCTGTAG ATGACTACAT 
CCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG ACGTATTCTC 
AATCACTGGT CGTGGTACTG TAGCAACAGG CCGTGTTGAA CGTGGTCAAA TCAAAGTCGG 
TGAAGAAGTT GAAATCATCG GTATCACTGA AGAAAGCAAG AAAACAACAG TTACAGGTGT 
AGAAATGTTC CGTAAATTAT TAGACTACGC TGAAGCTGGT GACAACATCG GTGCTTTATT 
ACGTGGTGTT GCACGTGAAG ACGTACAACG TGGACAAGTA TTAGCAGCTC CTGGCTCTAT 
T ACTCCACAC ACAAAATTCA AAGCTGATGT TTACGTTTTA TCTAAAGAAG AAGGTGGACG 
TCATACTCCA TTCTTCACTA ACTACCGCCC ACAATTCTAC TTCCGTACTA CTGACGTAAC 
TGGCGTTGTT CACTTACCAG AAGGTACTGA AATGGTTATG CCTGGCGATA ACGTAGAAAT 
GACTGTTGAA TTGATCGCTC CAATCGCGAT TGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactaae 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

CGGAGCTATC CTTGTAGTTG CTTCAACTGA TGGACCAATG CCACAAACTC GTGAGCACAT 

CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTA TTCATGAACA AAGTTGACCT 

TGTTGATGAT GAAGAATTGC TTGAATTGGT TGAAATGGAA ATTCGTGACC TTCTTTCAGA 

ATACGACTTC CCAGGTGATG ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCACTTGA 

AGGCGACGAA AAATACGAAG ACATCATCAT GGAATTGATG AGCACTGTTG ATGAGTACAT 

XCCAGAACCA GAACGTGATA CTGACAAACC TTTACTTCTT CCAGTTGAAG ATGTATTCTC 

AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC CGTGGTACTG TTCGTGTCAA 

CGACGAAGTT GAAATCGTTG GTATTAAAGA AGATATCCAA AAAGCAGTTG TTACTGGTGT 

TGAAATGTTC CGTAAACAAC TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT 

TCGTGGTGTT CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 
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CAACCCACAC ACXAAAXTXA ^ 

TCATACTCCA XXCXXCAACA ACXACCCXCC ACAATTCTAC XXCCGXACAA CXGACGXAAC 
AGGXXCAAXC GAACXXCCAG CAGGAACAGA AAXGGXXAXG CCXGGXGAXA ACGXXACXAX 
CGAAGXXGAA XXGAXXCACC CAAXCGCCGX AGAACAA 
(2) INFORMAXION FOR SEQ ID NO: 145 . 

(i) SEQUENCE CHARACXERISXICS- 

(A) LENGXH: 817 base pairs 

(B) XYPE: nucleic acid 

(C) SXRANDEDNESS : double 

(D) XOPOLOGY: linear 

(ii) MOLECULE XYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Streptococcus pneumoniae 
(Xi) SEQUENCE DESCRIPXION: SEQ ID NO: M5l 
CGGAGCXAXC CXXGXAGXAG CXXCAACXGA CGGACCAAXG CCACAAACXC GXGAGCACAX 
CCXXCXXXCA CGXCAGGXXG GXGXXAAACA CCXXAXCGXC XXCAXGAACA AAGXXGACXX 
GGXXGACGAC GAAGAAXXGC XXGAAXIGGX XGAAAXGGAA AXCCGXGACC XAXXGXCAGA 
AXACGACXXC CCAGGXGACG AXCXXCCAGX XAXCCAAGGX XCAGCACXXA AAGCXCXXGA 
AGGXGACXCX AAAXACGAAG ACAXCGXXAX GGAAXXGAXG AACACAGXXG AXGAGXAXAX 
CCCAGAACCA GAACGXGACA CXGACAAACC AIXGCXXCXI CCAGXCGAGG ACGXAXXCXC 
AAICACXGGA CGXGGXACAG XXGCXXCAGG ACGXAXCGAC CGXGGXAXCG XXAAAGXCAA 
CGACGAAAXC GAAAXCGXXG GXAXCAAAGA AGAAACXCRA AAAGCAGXXG XXACXGGIGX 
XGAAAXGIXC CGXAAACAAC XXGACGAAGG XCXXGCXGGA GAXAACGXAG GXGXCCXXCX 
TCGXGGXGXX CAACGXGAXG AAAXCGAACG XGGACAAGXX AXCGCXAAAC CAGGXXCAAX 
CAACCCACAC ACXAAAXXCA AAGGXGAAGX CXACAXCCXX ACXAAAGAAG AAGGXGGACG 
TCACACXCCA XXCXXCAACA ACXACCGXCC ACAAXXCXAC XXCCGXACXA CXGACGXXAC 
AGGXXCAAXC GAACXXCCAG CAGGXACXGA AAXGGXAAXG CCXGGXGAXA ACGXGACAAX 
CGACGXXGAG XXGAXXCACC CAAXCGCCGX AGAACAA 
(2) INFORMAXION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACXERISXICS- 

(A) LENGXH: 817 base pairs 

(B) XYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Vi) ORIGINAL SOURCE: M 7,varius 
(A) ORGANISM: Streptococcus salivarius 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 146: 
CGGTGCGATC CTTGTAGTAG CATCTACTGA CGGACCAATG CCACAAACTC GTGAGCACAT 
CCTTCTTTCA CGTCAGGTTG GTGTTAAACA CCTTATCGTC TTCATGAACA AAGTTGACTT 
GGTTGACGAT GAAGAATTGC TTGAATTGGT TGAAATGGAA ATCCGTGACC TTCTTTCAGA 
ATACGATTTC CCAGGTGATG ACATTCCAGT TATCCAAGGT TCAGCTCTTA AAGCTCTTGA 
AGGTGATTCT AAATACGAAG ACATCATCAT GGACTTGATG AACACTCTTG ACGAATACAT 
CCCAGAACCA GAACGTGACA CTGACAAACC ATTGTTGCTT CCAGTCGAAG ACGTATTCTC 
AATCACTGGT CGTGGTACTG TTGCTTCAGG ACGTATCGAC CGTGGTGTTG TTCGTGTCAA 
TGACGAAGTT GAAATCGTTG GTCTTAAAGA AGACATCCAA AAAGCAGTTG TTACTGGTGT 
TGAAATGTTC CGTAAACAAC TTGACGRAGG TATTGCCGGA GATAACGTCG GTGTTCTTCT 
TCGTGGTATC CAACGTGATG AAATCGAACG TGGTCAAGTA TTGGCTGCAC CTGGTTCAAT 
CAACCCACAC ACTAAATTCA AAGGTGAAGT TTACATCCTT TCTAAAGAAG AAGGTGGACG 
TC ACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC TTCCGTACAA CTGACGTAAC 
AGGTTCAATC GAACTTCCTG CAGGTACTGA AATGGTTATG CCTGGTGATA ACGTGACTAT 
CGACGTTGAG TTGATCCACC CAATCGCCGT TGAACAA 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
<xi) SE0UE»CE DESCEIPTIOM: SEQ ID M>, 147, 

cc^croc «««c TOmOTC 

«»»» C«™« GTCRGGTCGG CSTTCCCGCC 
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™ M «««« «— CC^c CGAGCTCGTC 
«™» TTCGGGAAC, TAC0ACITcc 

ATCAAGGGTT CGGCACTTGC TGCTCTTP a a 

TGCTCTTGAA GATTCTGACA AGAAGATCGG TGAAGACGCG 

ATCCGCGAGC TGATGGCTGC TCX CGACGCC TACATCCCGA CGCCTGAGCG TCCGATCGAC 
CAGCCGTTCC TGATGCCGAT CGAAGACGTG .CTCOATCT CGGGTCGTGG TACGGTTGTG 
ACGGGTCGCG TTGAGCGCGG TATCGTCAAG GTTGGTGAAG AAGTCGAAAT CGTCGGCATC 
-CCGACCT CGAAGACGAC TGTTACCGGC G TT GAAA TGT TCCCCAAGCT GCTCGACCAG 
OOCCAGGCCG GCGACAACAT CGGTGCACTC GTTCGCGGCG TTACCCGTGA CGGCGTCGAG 

CGTGGTCAGA TCCTGTGCAA GCCGGGTTCG GTCAAGCCGP *™„ 

TCAAGCCGC ACAAGAAGTT CATGGCAGAA 

«— CC «»„ AGAAGGCGGC 

GGGGAG^ ACTTCCGTAC GACTGACGTT ACCGGTATCG „™ TO AAGGCAGG 
GAAATGGTTA TGCCGGCG. CAACGTCACT GTTGAAGTCG AGCTCATCGT TCCGATCGCG 

ATGGAAGAAA AGCTGCGCTT CGCTATCCrr rn.r™, 

CGCTATCCGC GAAGGCGGCC GTACCGTCGG CGCCGGC 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH : 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Bacillus subtilis 

<*i) SEQUENCE DESCRIPTION: SEQ ID NO: 148 . 
ATGATCACTG GTGCTGCGCA AATGGACGGA GCTATCCTTG TAGTATCTGC TGCTGATGGC 
CCAATGCCAC AAACTCGTGA GCACATCCTT CTTTCTAAAA ACGTTGGTGT ACCATACATC 
«~ TAAACAAATG CGACATGGTA GACGACGAAG AGCTTCTTGA ACTAGTTGAA 
ATGGAAGTTC GCGATCTTCT TAGCGAATAC GACTTCCCTG GTGATGATGT ACCAOTTGTT 
AAAGGTTCTG CTCTTAAAGC TCTTGAAGGA GACGCTGAGT GGGAAGCTAA AATCTTCGAA 
CTTATGGATG CGGTTGATGA GTACATCCCA ACTCCAGAAC GCGACACTGA AAAACCATTC 
ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACAGTTGC TACTGGCCCT 
GTAGAACGCG GACAAGTTAA AGTCGGTGAC GAAGTTGAAA TCATCGGTCT TCAAGAAGAG 
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MmGWW C^ACTGTTAC AGGTGTTGAA ATGTTCCGTA AGCTTCTTGA TTACGCTGAA 
GCTGGTGACA ACATTGGTGC CCTTCTTCCC GGTGTATCTC GTGAAGAAAT CCAACGTGGT 
CAAGTACTTG CTAAACCAGG TACAATCACT CCACACAGCA AATTCAAAGC TGAAGTTTAC 
GTTCTTTCTA AAGAAGAGGG TGGACGTCAT ACTCCATTCT TCTCTAACTA CCGTCCTCAG 
TTCTACTTCC GTACAACTGA CGTAACTGGT ATCATCCATC TTCCAGAAGG CGTAGAAATG 
GTTATGCCTG GAGATAACAC TGAAATGAAC GTTGAACTTA TTTCTACAAT CGCTATCGAA 
GAAGGAACTC GTTTCTCTAT TCGTGAAGGC GGACGTACTG TTGGT 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroades fragzlis 

(xl ) SEQUENCE DESCRIPTION : SEQ ID NO: 149: 

ATGGTTACTG GTGCTGCTCA GATGGACGGT GCTATCATTG TAGTTGCTGC TACTGATGGT 

CCGATGCCTC AGACTCGTGA GCACATCCTT TTGGCTCGTC AGGTAAACGT TCCGAAGCTG 

GTTGTATTCA TGAACAAGTG CGATATGGTT GAAGATGCTG AGATGTTGGA ACTTGTTGAA 

ATGGAAATGA GAGAATTGCT TTCATTCTAT GATTTCGACG GTGACAATAC TCCGATCATT 

CAGGGTTCTG CTCTTGGTGC ATTGAACGGC GTAGAAAAAT GGGAAGACAA AGTAATGGAA 

CTGATGGAAG CTGTTGATAC TTGGATTCCA CTGCCTCCGC GCGATGTTGA TAAACCTTTC 

XTGATGCCGG TAGAAGACGT GTTCTCTATC ACAGGTCGTG GTACTGTAGC TACAGGTCGT 

ATCGAAACTG GTGTTATCCA TGTAGGTGAT GAAATCGAAA TCCTCGGTTT GGGTGAAGAT 

AAGAAATCAG TTGTAACAGG TGTTGAAATG TTCCGCAAAC TTCTGGATCA GGGTGAAGCT 

GGTGACAACG TAGGTCTGTT GCTTCGTGGT GTTGACAAGA ACGAAATCAA ACGTGGTATG 

GTTCTTTGTA AACCGGGTCA GATTAAACCT CACTCTAAAT TCAAAGCAGA GGTTTATATC 

CTGAAGAAAG AAGAAGGTGG TCGTCACACT CCATTCCATA ACAAATATCG TCCTCAGTTC 

TACCTGCGTA CTATGGACTG TACAGGTGAA ATCACTCTTC CGGAAGGAAC TGAAATGGTA 

ATGCCGGGTG ATAACGTAAC TATCACTGTA GAGTTGATCT ATCCGGTTGC ACTGAACATC 
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GGTCTTCGTT TCGCTATCCG CGAAGGTGGA CGTACAGTAG GT 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
AATATGATTA CAGGAGCAGC TCAAATGGAT GCAGCGATAC TTTTAGTTGC 
GGTGCTGAGC CTCAAACAAA AGAGCATTTG CTTCTTGCTC AAAGAATGGG 
ATAATAGTTT TTTTAAATAA ATTGGACTTA GCAGATCCTG AACTTGTTGA 
GTTGAAGTTT TAGAACTTGT TGAAAAATAT GGCTTTTCAG CTGATACTCC 
GGTTCAGCTT TTGGGGCTAT GTCAAATCCA GAAGATCCTG AATCTACAAA 
GAACTTCTTG AATCTATGGA TAATTATTTT GATCTTCCAG AAAGAGATAT 
TTTTTGCTTG CTGTTGAAGA TGTATTTTCT ATTTCAGGAA GAGGCACTGT 
CGTATTGAAA GAGGTATTAT TAAAGTTGGT CAAGAAGTTG AAATAGTTGG 
ACCAGAAAAA CTACTGTTAC TGGTGTTGAA ATGTTCCAGA AAATTCTTGA 
GCAGGGGATA ATGTTGGTCT TCTTTTGAGA GGCGTTGATA AAAAAGACAT 
CAAGTTTTGT CAGCTCCAGG TACAATTACT CCACAC*AGA AATTTAAAGC 
TGTTTGACTA AAGAAGAAGG CGGTAGGCAC AAGCCATTTT TCCCAGGGTA 
TTCTTTTTTA GAACAACCGA TGTTACTGGA GTTGTTGCTT TAGAGGGCAA 
ATGCCTGGTG ATAATGTTGA TATTATTGTT GAGCTGATCT CTTCAATAGC 
AATGTAGAAT TTGCTGTTCG AGAAGGTGGA AGAACCGTTG CTTCAGGA 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TGCTGATAGT 
AATAAAGAAA 
GCTTGTTGAA 
AATAATCAAA 
ATGCGTTAAA 
TGACAAGCCA 
TGCTACTGGG 
AATTAAAGAA 
GCAAGGTCAA 
TGAGAGGGGG 
TTCAATTTAT 
TAGACCACAG 
AGAAATGGTT 
TATGGATAAG 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevibacterium lxnens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
AACATGATCA CCGGTGCCGC TCAGATGGAC GGTGCGATCC TCGTCGTCGC CGCTACCGAC 
GGACCGATGC CCCAGACCCG TGAGCACGTG CTGCTCGCGC GTCAGGTCGG CGTTCCCTAC 
ATCGTCGTGG CTCTGAACAA GTCCGACATG GTCGATGACG AGGAGCTCCT CGAGCTCGTC 
GAATTCGAGG TCCGCGACCT GCTCTCGAGC CAGGACTTCG ACGGAGACAA CGCTCCGGTC 
ATTCCGGTGT CCGCTCTCAA GGCGCTGGAA GGCGACGAGA AGTGGGTCAA GAGCGTTCAG 
GATCTCATGG CTGCCGTCGA TGACAACGTT CCGGAGCCGG AGCGCGATGT CGACAAGCCG 
TTCCTCATGC CCGTCGAGGA CGTCTTCACG ATCACCGGTC GTGGAACCGT CGTCACCGGT 
CGTGTCGAGC GCGGCGTGCT CCTGCCTAAC GACGAAATCG AAATCGTCGG CATCAAGGAG 
AAGTCGTCCA AGACGACTGT CACCGCTATC GAGATGTTCC GCAAGACCCT GCCGGATGCC 
CGTGCAGGTG AGAACGTCGG TCTGCTCCTC CGCGGCACCA AGCGCGAGGA TGTTGAGCGC 
GGTCAGGTCA TCGTGAAGCC GGGTTCGATC ACCCCGCACA CCAAGTTCGA GGCTCAGGTC 
TACATCCTGA GCAAGGACGA GGGCGGACGT CACAACCCGT TCTACTCGAA CTACCGTCCG 
CAGTTCTACT TCCGGACCAC GGACGTCACC GGTGTCATCA CGCTGCCCGA GGGCACCGAG 
ATGGTCATGC CCGGCGACAA CACCGATATG TCGGTCGAGC TCATCCAGCC GATCGCTATG 
GAGGACCGCC TCCGCTTCGC AATCCGCGAA GGTGGCCGCA CCGTCGGCGC CGGT 
(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: BurJcholderia cepacia 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

ATGATCACGG GCGCAGCGCA GATGGACGGC GCGATCCTGG TTTGCTCGGC AGCAGACGGC 

CCGATGCCGC AAACGCGTGA GCACATCCTG CTGGCGCGTC AGGTTGGTGT TCCGTACATC 

ATCGTGTTCC TGAACAAGTG CGACAGTGTG GACGACGCTG AACTGCTCGA GCTGGTCGAG 
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ATGGAAGTTC GCGAACTCCT GTCGAAGTAC GACTTCCCGG GCGACGACAC GCCGATCGTG 
AAGGGTTCGG CCAAGCTGGC GCTGGAAGGC GACACGGGCG AGCTGGGCGA AGTGGCGATC 
ATGAGCCTGG CAGACGCGCT GGACACGTAC ATCCCGACGC CGGAGCGTGC AGTTGACGGC 
GCGTTCCTGA TGCCGGTGGA AGACGTGTTC TCGATCTCGG GCCGTGGTAC GGTGGTGACG 
GGTCGTGTCG AGCGCGGCAT CGTGAAGGTC GGCGAAGAAA TCGAAATCGT CGGTATCAAG 
CCGACGGTGA AGACGACCTG CACGGGCGTT GAAATGTTCC GCAAGCTGCT GGACCAAGGT 
CAGGCAGGCG ACAACGTCGG TATCCTGCTG CGCGGCACGA AGCGTGAAGA CGTGGAGCGT 
GGCCAGGTTC TGGCGAAGCC GGGTTCGATC ACGCCGCACA CGCACTTCAC GGCTGAAGTG 
TACGTGCTGA GCAAGGACGA AGGCGGCCGT CACACGCCGT TCTTCAACAA CTACCGTCCG 
CAGTTCTACT TCCGTACGAC GGACGTGACG GGCTCGATCG AGCTGCCGAA GGACAAGGAA 
ATGGTGATGC CGGGCGACAA CGTGTCGATC ACGGTGAAGC TGATTGCTCC GATCGCGATG 
GAAGAAGGTC TGCGCTTCGC AATCCGTGAA GGCGGCCGTA CGGTCGGC 
(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE : 

<A) ORGANISM: Chlamydia trachomatis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 : 

AACATGATCA CCGGTGCGGC TCAAATGGAC GGGGCTATTC TAGTAGTTTC TGCAACAGAC 

GGAGCTATGC CTCAAACTAA AGAGCATATT CTTTTGGCAA GACAAGTTGG GGTTCCTTAC 

ATCGTTGTTT TTCTCAATAA AATTGACATG ATTTCCGAAG AAGACGCTGA ATTGGTCGAC 

TTGGTTGAGA TGGAGTTGGC TGAGCTTCTT GAAGAGAAAG GATACAAAGG GTGTCCAATC 

ATCAGAGGTT CTGCTCTGAA AGCTTTGGAA GGAGATGCTG CATACATAGA GAAAGTTCGA 

GAGCTAATGC AAGCCGTCGA TGATAATATC CCTACTCCAG AAAGAGAAAT TGACAAGCCT 

TTCTTAATGC CTATTGAGGA CGTGTTCTCT ATCTCCGGAC GAGGAACTGT AGTAACTGGA 

CGTATTGAGC GTGGAATTGT TAAAGTTTCC GATAAAGTTC AGTTGGTCGG TCTTAGAGAT 

ACTAAAGAAA CGATTGTTAC TGGGGTTGAA ATGTTCAGAA AAGAACTCCC AGAAGGTCGT 
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GCAGGAGAGA ACGTTGGATT GCTCCTCAGA GGTATTGGTA AGAACGATGT GGAAAGAGGA 
ATGGTTGTTT GCTTGCCAAA CAGTGTTAAA CCTCATACAC AGTTTAAGTG TGCTGTTTAC 
GTTCTGCAAA AAGAAGAAGG TGGACGACAT AAGCCTTTCT TCACAGGATA TAGACCTCAA 
TTCTTCTTCC GTACAACAGA CGTTACAGGT GTGGTAACTC TGCCTGAGGG AGTTGAGATG 
GTCATGCCTG GGGATAACGT TGAGTTTGAA GTGCAATTGA TTAGCCCTGT GGCTTTAGAA 
GAAGGTATGA GATTTGCGAT TCGTGAAGGT GGTCGTACAA TCGGTGCTGG A 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coll 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

AACATGATCA CCGGTGCTGC GCAGATGGAC GGCGCGATCC TGGTAGTTGC TGCGACTGAC 

GGCCCGATGC CGCAGACTCG TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC 

ATCATCGTGT TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC GAAAATCCTG 

GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG AGCGTGCGAT TGACAAGCCG 

TTCCTGCTGC CGATCGAAGA CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT 

CGTGTAGAAC GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 

GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT CGAACGTGGT 

CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA AGTTCGAATC TGAAGTGTAC 

ATTCTGTCCA AAGATGAAGG CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG 

TTCTACTTCC GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC 

GACGGTCTGC GTTTCGCAAT CCGTGAAGGC GGCCGTACCG TTGGCGCGGG C 
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(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 
(V) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : FiJbroJbacter succinogenes 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 155 : 
AACATGGTGA CTGGTGCTGC TCAGATGGAC GGCGCTATCC TCGTTGTTGC CGCTACTGAC 
«™»™c C^CCTC C^CTC^C ACCRGGTTCG OTOc™, 

ATCGTCGTGT TCATGAACAA GTGCGACATG GTTGACGATG CTGAAATTCT CGACCTCGTC 
CAAATGGAAG TTCGCGAACT CCTCTCCAAG TATGACTTCG ACGGTGACAA CACCCCGATC 
ATCCGTGGTT CCGCTCTCAA GGCCCTCGAA GGCGATCCGG AATACCAGGA CAAGGTCATG 
OAACTCATGA ACGCTTGCGA CGAATACATC CCGCTCCCGC AGCGCGATAC CGACAAGCCG 
TTCCTCATGC CGATCGAAGA CGTGTTCACG ATTACTGGCC GCGGCACTGT CGCTACTGGC 
CGTATCGAAC GCGGTGTCGT TCGCTTGAAC GACAAGGTTG AACGTATCGG TCTCGGTGAA 
ACCACCGAAT ACGTCATCAC CGGTGTTGAA ATGTTCCGTA AGCTCCTCGA CGACGCTCAG 
CCAGGTGACA ACGTTGGTCT CCTCCTCCGT GGTGCTGAAA AGAAGGACAT CGTCCGTGGC 
ATGGTTCTCG CAGCTCCGAA GTCTGTCACT CCGCACACCG AATTTAAGGC TGAAATCTAC 
OTTCTCACGA AGGACGAAGG TGGCCGTCAC ACGCCGTTCA TGAATGGCTA CCGTCCGCAG 
TTCTAC^CC GCACC*CCGA CGTTACTGGT ACGATCCAGC TCCCGGAAGG TGTCGAAATG 
GTTACTCCGG GTGACACGGT CACGATCCAC GTGAACCTCA TCGCTCCGAT CGCTATGGAA 
AAGCAGCTCC GCTTCGCTAT CCGTGAAGGT GGACGTACTG TTGGTGCTGG C 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Flavobacterivm ferrugineum 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
AACATGATCA CCGGTGCTGC CCAGATGGAC GGTGCTATCT TAGTTGTGGC TGCATCAGAC 
GGTCCTATGC CTCAAACAAA AGAACACATC CTGCTTGCTG CCCAGGTAGG TGTACCTAAA 
ATGGTTGTGT TTCTGAATAA AGTTGACCTC GTTGACGACG AAGAGCTCCT GGAGCTGGTT 
GAGATCGAGG TTCGCGAAGA ACTGACTAAA CGCGGTTTCG ACGGCGACAA CACTCCAATC 
ATCAAAGGTT CCGCTACAGG CGCCCTCGCT GGTGAAGAAA AGTGGGTTAA AGAAATTGAA 
AACCTGATGG ACGCTGTTGA CAGCTACATC CCACTGCCTC CTCGTCCGGT TGATCTGCCG 
TTCCTGATGA GCGTAGAGGA CGTATTCTCT ATCACTGGTC GTGGTACTGT TGCTACCGGT 
CGTATCGAGC GTGGCCGTAT CAAAGTTGGT GAGCCTGTTG AGATCGTAGG TCTGCAGGAG 
TCTCCCCTGA ACTCTACCGT TACAGGTGTT GAGATGTTCC GCAAACTCCT CGACGAAGGT 
GAAGCTGGTG ATAACGCCGG TCTCCTCCTC CGTGGTGTTG AAAAAACACA GATCCGTCGC 
GGTATGGTAA TCGTTAAACC CGGTTCCATC ACTCCGCACA CGGACTTCAA AGGCGAAGTT 
TACGTACTGA GCAAAGACGA AGGTGGCCGT CACACTCCAT TCTTCAACAA ATACCGXCCT 
CAATTCTACT TCCGTACAAC TGACGTTACA GGTGAAGTAG AACTGAACGC AGGAACAGAA 
ATGGTTATGC CTGGTGATAA CACCAACCTG ACCGTTAAAC TGATCCAACC GATCGCTATG 
GAAAAAGGTC TGAAATTCGC GATCCGCGAA GGTGGCCGTA CCGTAGGTGC AGGA 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC AGCAACAGAT 

GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGTC GCCAAGTAGG TGTTCCATAC 

ATCATCGTAT TCTTAAACAA ATGCGACATG GTAGATGACG AAGAGTTATT AGAATTAGTC 

GAAATGGAAG TTCGTGAACT TCTATCTCAA TATGACTTCC CAGGTGACGA TACACCAATC 
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GTACGTGGTT AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 

GAGTTAGCAA ACCACTTAGA TACTTACATC CCAGAACCAG AACGTGCGAT TCACCAACCG 
TTCCTTCTrC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC GTGGTACTGT AGTAACAGGT 
CGTGTAGAAC GAGGTATTAT CCGTACAGGT GATGAAGTAG AAATCGTCGG TATCAAAGAT 
ACAGCGAAAA CTACTGTAAC GGGTGTTGAA ATOTTCCGTA AATTACTTGA CGAAGGTCGT 
GCAGGTGAAA ACATCGGTGC ATTATTACGT GGTACCAAAC G^AAGAAAT GGAAGGTGGT 
CAAGTATTAG CGAAACCAGG TTCAATCACA GGACAGAC^ ACT^AATC AGAAGTGTAG 
GTATTATCAA AAGATGAAGG TCGTCGTCAT ACTCCA^CT TCAAAGGTTA CCG^CACAA 
TTCTATTTCC GTACAACAGA CGTGACTOGT ACAAXCGAAT TACCAGAAGG OGTGGAAATO 
GTAATGCCAG GCGATAACAT CAAGATGACA GTAAGCTTAA TCCACCCAAT TGCGATGGAT 
CAAGGTTTAG GTTTCGCAAT CCGTGAAGGT GGCCGTACAG TAGGTGCAGG G 
(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: UNA (genomic) 
(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 
AACATGATCA CCGGIGCGGC GCAAATGGAC GGAGOGATTT rC^^C TGCAGCTGAT 
GGCCCTATGC CTCAAACTAG GGAGCATATC TTA^^TG GTCAAGTAGG GGTGGCTGAG 
ATCGTTGTTT TCITAAACAA ACAAGACATG GTAGATGACC AAGAATTGTT AGAACTTGTA 
GAAATGGAAG TGCGCGAATT GTIGAGCGCG TATGAATTTC CTGGCGATGA CACTCCTATC 
GTAQCGGGTT CAGCTTTAAG AGCTTTAGAA GAAGGAAAGG CTGGTAATGT GGGTGAATGG 
<*TGAAAAAG TGCTTAAACT TATGGCTGAA GTGGATGCCT ATATCCCTAC TCGAGAAAGA 
GACACTGAAA AAAC^GTT GATGCCGGTT GAAGATGTGT TCTCTATT6C GGGTAGAGGG 
ACTGTGGTTA CAGGTAGGAT TGAAAGAGGC GTGGTOAAAG TAGGCGATGA AGTGGAAATC 
GTTGGTATCA GACCTACACA AAAAACGACT GTAACCGGTG TAGAAATGTT TAGGAAAGAG 
TTGGAAAAAG GTGAAGCCGG CGATAATGTG GGCGTGCTTT TGAGAGGAAC TAAAAAAGAA 
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GAAGTGGAAC 


rrfiRTATGGT 


TCTATGCAAA CCAGGTTCTA 


TCACTCCGCA 


CAAGAAATTT 


660 


GAGGGAGAAA 


TTTATGTCCT 


TTCTAAAGAA GAAGGCGGGA 


GACACACTCC 


ATTCTTCACC 


720 


AATTACCGCC 


CGCAATTCTA 


TGTGCGCACA ACTGATGTGA 


CTGGCTCTAT 


CACCCTTCCT 


780 


GAAGGCGTAG 


AAATGGTTAT 


GCCTGGCGAT AATGTGAAAA 


TCACTGTAGA 


GTTGATTAGC 


840 


CCTGTTGCGT 


TAGAGTTGGG 


AACTAAATTT GCGATTCGTG 


AAGGCGGTAG 


GACCGTTGGT 


900 












906 



GCTGGT 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus luteus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
AACATGATCA CCGGCGCCGC TCAGATGGAC GGCGCGATCC TCGTGGTCGC CGCTACCGAC 
GGCCCGATGG CCCAGACCCG TGAGCACGTG CTCCTGGCCC GCCAGGTCGG CGTGCCGGCC 
CTGCTCGTGG CCCTGAACAA GTCGGACATG GTGGAGGACG AGGAGCTCCT CGAGCGTGTC 
GAGATGGAGG TCCGGCAGCT GCTGTCCTCC AGGAGCTTCG ACGTCGACGA GGCCCCGGTC 
ATCCGCACCT CCGCTCTGAA GGCCCTCGAG GGCGACCCCC AGTGGGTCAA GTCCGTCGAG 
GACCTCATGG ATGCCGTGGA CGAGTACATC CCGGACCCGG TGCGCGACAA GGACAAGCCG 
TTCCTGATGC CGATCGAGGA CGTCTTCACG ATCACCGGCC GTGGCACCGT GGTGACCGGT 
CGCGCCGAGC GCGGCACCCT GAAGATCAAC TCCGAGGTCG AGATCGTCGG CATCCGCGAC 
GTGCAGAAGA CCACTGTCAC CGGCATCGAG ATGTTCCACA AGCAGCTCGA CGAGGCCTGG 
GCCGGCGAGA ACTGCGGTCT GCTCCTGCGC GGTCTGAAGC GCGACGACGT CGAGCGCCGC 
CAGGTGCTGG TGGAGCCGGG CTCCATCACC CCGCACACCA ACTTCGAGGC GAACGTCTAC 
ATCCTGTCCA AGGACGAGGG TGGGCGTCAC ACCCCGTTCT ACTCGAACTA CCGCGCGCAG 
.XCTACTTCC GCACCACCGA CGTCACCGGC GTCATCACGC TGCCCGAGGG CACCGAGATG 
GTCATGCCCG GCGACACCAC CGAGATGTCG GTCGAGCTCA TCCAGCCGAT CGCCATGGAG 
GAGGGCCTCG GCTTCGCCAT CCGCGAGGGT GGCCGCACCG TGGGCTCCGG C 
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(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE • 

(A) ORGANISM : ^yeotacteriu. tuberculosis 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
AACAXGAXCA CCGGCGCCCC GCAGAXGGAC GGTGCGAXCC XGGXCGXCGC CGCCACCGAC 
-CCOAXGC CCCAGACCCG CGAGCACGTT CTGCTGGCGC GTCAAGXGGG TGTGCCCTAC 
-CXGGXAG CGCXGAACAA GGCCGACGCA GTGGACGACG AGGAGCTGCT CGAACXCGXC 
GAGATGGAGG XCCGCCAGCX GCXGGCXGCC CAGGAATTCG ACGAGGACGC CCCGGTTGTG 
CGGGTCTCGG CGCTCAAGGC GCTCGAGGGT GACGCGAAGT GGGTXGCCXC TGTCGAGGAA 
CTGATGAACG CGGTCGACGA GTCGATTCCG GACCCCCXCC GCGAGACCGA CAACCCGXXC 
CXGAXGCCGG TCGAGGACGT CXXCACCAXX ACCGGCCGCG GAACCGTGGT CACCGGACGX 
GTGGAGCGCG GCGTGATCAA CGTGAACGAG GAAGTTGAGA XCCXCGCCAX TC GCCCA T CG 
ACCACCAAGA CCACCGXCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA CCAGGGCCAG 
GCGGGCGACA ACGTTGGTTT GCXGCXGCGG GGCGTCAAGC GCGAGGACGT CGAGCGTGGC 
-CXXGXCA CCAAGCCCGG CACCACCACG CCGCACACCG AGTTCGAAGG CCAGGTCTAC 
-CXGXCCA AGGACGAGGG CGGCCGGCAC ACGCCGXXCX TCAACAACXA CCGXCCGCAG 
-CXACXXCC GCACCACCGA CGXGACCGCX GTGGTGACAC TGCCGGAGGG CACCGAGATG 
OXGAXGCCCG GXGACAACAC CAACAXCXCG GTGAAGXXGA XCCAGCCCGX CGCCAXGGAC 
CAAGGXCXGC GXXXCGCGAX CCGCGAGGGX GGCCGCACCG XGGGCGCCGG C 
(2) INFORMAXION FOR SEQ ID NO: 1S1 . 

(i) SEQUENCE CHARACXERISXICS : 

(A) LENGTH: 891 base pairs 

(B) xypb: nucleic acid 

(C) SXRANDEDNESS : double 

(D) XOPOLOGY: linear 

(ii) MOLECULE XYPB: DNA (genomic) 
(vi) ORIGINAL SOURCE: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
891 



SUBSTITUTE SHEET (RULE 26) 



PCT/CA97/00829 

WO 98/20157 

- 146 - 

(A) ORGANISM: Mycoplasma 5 enitaliu» 
(Xl) SKWraCE DESCRIPTION SEO IE KO: Ml. 
„CA 0AGGEGO1.C AOAAAXGGAT — ™ TflGTTQTTTC AGCAACTGAT 
^ATGO CCCAAACCCG OGAGOAOATO ™GTTGCOC GGCAAGTAGG 
ATGGTAGTTT TTCTAAACAA GTGTGATATT GCTAGTGATG AAGAGGTACA AGAACTTGTT 
GCTGAAGAAG TACGTGATCT GTTAACTTCC TATGGTTTTO ATGGTAAGAA CACTCCTATT 
ATTTATGGCT CAGCTTTAAA AGCATTGGAA GGTGATCCAA AGTGGGAGGC TAAGATCCAT 
GATTTGATTA AAGCAGTTGA TQAATOGATT CCAACTCCTA CACGTGAAGT AGATAAACCT 
TTCTTATTAG CAATTGAAGA TACGATGACC ATTACTGGTA GAGGTACAGT TGTTACAGGA 
AGAGTTGAAA GAGGTGAACT CAAAGTAGGT CAAGAAGTTG AAATTGTTGG TTTAAAACCA 
ATTAGAAAAG CAGTTGTTAC 1GGAATTGAA ATGTTCAAAA AGGAACTTGA TTCAGCAATG 
gctggtgaca ATGGTGGGOT ATTATTACGT GGTGTTGAAC GTAAAGAAGT TGAAAGAGGT 
CAAGTTTTAG CAAAACCAGG CTCTATTAAA CCGCACAAGA AATTTAAAGC TGAGATCTAT 
GCTTTAAAGA AAGAAGAAGG ^AGAGAG «™n TAAACGGTTA CCGTCCTCAA 
TTCTATTTCC GTACCACTGA TGTAACTGGT TCTATTGCTT TAGCTGAAAA TACTGAAATG 
(JTTCTACCTG GTGATAATGC nGTATTACT GTTGAGTTAA TTGCTCCTAT CGCTTGTGAA 
AAAGGTAGTA AGTTCTCAAT TCGTGAAGGT GGTAGAACTG TAGGGGCAGG G 
(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: DNA (genomic) 

fv-n ORIGINAL SOURCE: 

(A) ORGANISM: Seis^ria joaorrheae 

SKJOEBCE DBSCRIOTOS, SEQ ID RO, 162: 

aacatcatta gcggcggcgg acaaaeggac ggtggaaecg tggtatg™ tgctggcgac 

GGCCCTATGC CGCAAACCCG CGAAGAGATG OTGOIGOOCO GTCARGTAGG CGTACCITAC 

atcatcgtgt ieatgaagaa atgogaoatg gtcgacgatq oggagoioti gcaactggtt 

QAAATGGAAA TCCGCGACCT GCTGTCCAGC TACGACTTCC CCGGCGACGA CTGCCCGATC 
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AGCG^GAA cTTACGAAGA AAAAA^ 

-CTCC™ CCGCATTGGA CAGATACATC CCGACrCGCG ^ ^ 

CTATCGAAGA CGTG1TCTCC A^CGGCG ^ MCC0T 
GG^AGAGC GAGGTATCAT CCm ca „ TCMAAAGAA 
ACCCAAAAAA GCA OTCT AC CGGCGTTGAA A^CA AACTGCTGGA CGAAGGTCAG 
GCGGGCGACA ACGTAGGCGT A^CGC GG.ACOAAAG G^AA^ 
CAGGTATTGG CCAAACGGGG TACTATCACT CCTCACACCA AGTTCAAAGC AGAAGTGTAC 
GTATTGAGCA AAGAAGAGGG CGGCCCCCAT ACCCCGTTTT TCGCCAACTA CCGTCCCCAA 
—C** GTACCACTGA CGTAACCGGC ACGATTACTT W3GAAAAAGG „ ra 
GTAATGCCGG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT CGCTATGGAA 
GAAGGTCTGC GCTTTGOGAT TCGCGAAGGC GGCCGTACCG „ 
(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 
(vi) ORIGINAL SOURCE • 

(A) ORGANISM: Rickettsia pro^ekii 
<*i> SEQUENCE DESCRIPTION: SEQ ID NO- 1 S3 . 
AATATGATAA G„C GGTGCTATAT TAGTAGTTTC TGCTGCTOAT 

GGTCCTATGC CTCAAACTAG AGAACATATA TTACIGGCAA AACAGGTAGG ^ m 
A^A, m ^ SGT1Gjrara nmms am AGAATTAGIT 
GAGATGGAAG TAAGAGAATT ATTATCAAAA TATGGTTTCC CTGGTAATGA AA^ 
ATTAAAGGTT C^A AGCTTTAGAA GGAAAACGXG AAGGTGAAAA AGCTATTAAT 

TACGTATATA CCTCAGCCTA TAGAGC1ACA AGA^ 
CAATAGAGGA mcr 

AGAGTGGAGT CAGGCATAAT TAAGGTGGGT GAAGAAATTG AAATAGTAGG TCTAAAAAAT 
~ CGACTTQTAC A^AGAA A^^AGA, AA^ 
^GTGGAGATA ATGTCGGTAT ATTACTACGT GGTACAAAAA GAGAAGAAGT AGAAAGAGGA 
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caaaacctgg gagcataaaa ccgcatgata ***** tgaagtgtat 

GTGCTTAGTA — TGGACGTCAT ACCCCATXTA CTAATGATTA T CGCCCACAG 
TTCTATTTTA GAACAACAGA TGTTACCGGC ACAATAAAAT TGCCTTCTGA TAAGCAGATG 
GTTATGCCTG GAGATAATGC TACTTTTTCA GTAGAATTAA TTAAGCCGAT TGCTATCCAA 
GAAGGGTTAA AATTCTCTAT ACGTGAAGGT GGTAGAACAG TAGGAGCCGG T 
(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 164: 
„CA CCGGTGCGC —» ~ — "* TGGTTGTTGC 
GGCCCGATGC CGCAGACCCG ^AGCACATC C~C «— 
ATCATCGTGT ^GAACAA «— ™ 

sagatggaac ™c ™cg «™ 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG «— « 

GCTTCCTGGA TTCTTATATT CCGGAACCAO AGCGTGCGAT TGACAAGCCG 
TTCCTGCTGC CGATCGAAGA CG^CTCG ATCTCCGGTC GTGGTACCGT TGTTACCGGT 
CGTCTAGAGC GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 
ACTCAGAAGT CTACC^AC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 
GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAA * CQAACGTGGT 
CAGQTACTGG CTAAGCCGGG CACCATCAAG CCGCACAGCA AGTTCGAATC TGAAGTGTAC 
ATTCTGTCCA AAGATGAAGG CGGCCOTCA, ACCCGTTCT TCAAAGGCTA CCGTCCGCAG 
TTCTACTTCC GTACTACIGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 
GTAATGCCGG GCGACAACAT CAAAATGGTT GTTAGCGTGA TCCACCCGAT CGCGATGGAC 
GACGGTCTGC GTTTCGCAAT CCGTCAAGGC GGCCGTACCG TTGGCGCGGG C 
(2) INFORMATION FOR SEQ ID NO: 165: 
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(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putida 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165.- 

ATGATCACTG GTGCTGCACA GATGGACGGC GCGATTCTGG T^rnn 

mUTCTSG TAGTCGCTTC AACAGACGGT 

CCAATGCCAC AGACTCGTGA GCACATCCTG CTTTCTCGTC AGGTTGGCGT ACCATTCATC 
~CA TGAACAAATG TGACATGGTA GATGACGAAG AGCTGTTAGA GCTAGTTGAG 
ATGGAAGTGC GTGAACTGTT ATCAGAATAC GATTTCCCAG GTGATGACTT ACCGGTAATC 
CAAGGTTCAG CTCTGAAAGC GCTAGAAGGC GAGCCAGAGT GGGAAGCAAA AATCCTTGAA 
TTAGCAGCGG CGCTGGATTC TTACATTCCA GAACCACAAC GTGACATCGA TAAGCCGTTC 
CTACTGCCAA TCGAAGACGT ATTCTCAATT TCAGGCCGTG GTACAGTAGT AACAGGTCGT 
OTTGAGCGTG GTATTGTACG CGTAGGCGAC GAAGTTGAAA TCGTTGGTGT ACGTGCGACA 
ACTAAGACAA CGTGTACTGG TGTAGAAATG TTCCGTAAAC TGCTTGACGA AGGTCGTGCA 

~ — ATGACGTAGA ACGTGGTCAA 

™ga AGCCAGGTTC AATCAACCCA CACACTACT. ttgaatcaga AGTTTACGTA 
CTGTCAAAAG AAGAAGGTGG TCGTCACACG CCATTCTTCA AAGGCTACCG TCCACAGTTC 

™ta CAACTGACGT AACCGGTACT ATCGAACTGC CAGAAGGCGT AGAGATGGTA 

™ C G ATAACATCAA GATGGTAGTG ACACTGATTT GCCCAATCGC GATGGACGAA 

GGTTTACGCT TCGCAATCCG TGAAGGCGGT CGTACAGTGG T 

(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Vi) ORIGINAL SOURCE- 

(A) ORGANISM : Stigma tella aurantiaca 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
AACATGATCA CGGGCGCGGC GCAGATGGAC GGAGCGATTC TGGTGGTGTC CGCGGCCGAC 
OOCCCGATGC CCCAGACGCG TGAGCACATC CTGCTGGCCA GGCAGGTGGG CGTGCCCTAC 
ATCGTCGTCT TCCTGAACAA GGTGGACATG CTGGACGATC CGGAGCTGCG CGAGCTGGTG 
GAGATGGAGG TGCGCGACCT GCTCAAGAAG TACGAGTTCC CGGGCGACAG CATCCCCATC 
ATCCCTGGCA GCGCGCTCAA GGCGCTGGAG GGAGACACCA GCGACATCGG CGAGGGAGCG 
TGATGGCGGC GGTGGACGAG TACATCCCGA CGCCGCAGCG TGCGACGGAC 
^GCCGTTCC TGATGCCGGT GGAAGACGTG TTCTCCATCG CAGGCCGAGG AACGGTGGCG 
ACGGGCCGAG TGGAGCGCGG CAAGATCAAG GTGGGCGAGG AAGTGGAGAT CGTGGGGATC 
CGTCCGACGC AGAAGACGGT CATCACGGGG GTGGAGATGT TCCGCAAGCT GCTGGACGAG 
GGCATGGCGG GAGACAACAT CGGAGCGCTG CTGCGAGGCC TGAAGCGCGA GGACCTGGAG 
CGTGGGCAGG TGCTGGCGAA CTGGGGGAGC ATCAACCCGC ACACGAAGTT CAAGGCGCAG 
GTGTACGTGC TGTCGAAGGA AGAGGGAGGG CGGCACACGC CGTTCTTCAA GGGATACCGG 
CCGCAGTTCX ACTTCCGGAC GACGGACGTG ACCGGAACGG TGAAGCTGCC GGACAACGTG 
GAGATGGTGA TGCCGGGAGA CAACATCGCC ATCGAGGTGG AGCTCATTAC TCCGGTCGCC 
ATGGAGAAGG AGCTGCCGTT CGCCATCCGT GAGGGTGGCC GCACGGTGGG CGCCGGC 
(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A ) ORGANISM: Streptococcus pyogenes 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
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AACATGATCA 


CTGGTGCCGC 


TCAAATGGAC 


GGAGCTATCC TTGTAGTTGC 


TTCAACTGAT 


60 


GGACCAATGC 


CACAAACTCG 


TGAGCACATC 


CTTCTTTCAC GTCAGGTTGG 


TGTTAAACAC 


120 


CTTATCGTGT 


TCATGAACAA 


AGTTGACCTT 


GTTGATGACG AAGAGTTGCT 


TGAATTAGTT 


180 


GAGATGGAAA 


TTCGTGACCT 


TCTTTCAGAA 


TACGATTTCC CAGGTGATGA 


CCTTCCAGTT 


240 


ATCCAAGGTT 


CAGCTCTTAA 


AGCTCTTGAA 


GGCGACACTA AATTTGAAGA 


CATCATCATG 


300 
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GAATTGATGG ATACTGTTGA TTCATACATT Ctor**n~ 

ATACATT CCAGAACCAG AACGCGACAC TGACAAACCA 
TTGCTTCTTC CAGTCGAAGA CGTATTCTPA 

GTATTCTCA ATTACAGGTC GTGGTACAGT TGCTTCAGGA 

CGTATCCACC GTGGTACTGT TCGTGTCAAC GACGAflATCG AAATCGTTGG TATCAAAGAA 
GAAACTAAAA AAGCTGTTGT ^ GAAATGTTCC GTAAACAACT TGACGAAGGT 
CTTGCAGGAG ACAACGTAGG TATCCTTCTT CCTCGTGTIC AACGTGACGA AATCGAACGT 
GGTCAAGTTA TTGCTAAACC 

TATATCCTTT CTAAAGACGA AGGTGGACGT CACAC T CCA T TCTTCAACAA CTACCGTCCA 
****** TCCGTACAAC TGACGTAACA GGTTCAATCG AACTTCCAGC AGGTACAGAA 
ATGGTTATGC CTGGTGATAA CGTGACAATC 

GACAATC AACGTTGAGT TGATCCACCC AATCGCCGTA 

GAACAAGGTA CTACTTTCTC AATCCGTGAA GGTCC zrr^ 

v-t.it.AA GGTGGACGTA CTGTTGGTTC AGGT 

(2) INFORMATION FOR SEQ ID NO: i 68 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 897 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

Cvi) ORIGINAL SOURCE- 

(A) ORGANISM : Ttl obacil l us cuprinus 

Ui) SEQUENCE DESCRIPTION : SEQ ID NO- 168 - 
AACATGATCA CCGGTGCGGC CCAGATGGAC GGCGCCATCC TGGTCGTGTC CGCCGCCGAC 
OOCCCCATGC CCCAAACCCG CGAGCACATC CTGCTGGCGC GTCAGGTGGG CGTGCCCTAC 
ATCATCGTGT TCCTCAACAA GTGCGACATG GTCGACGACG CCGAGCTGCX CGAACTCGTC 
GAGATGGAAG TGCGCGAGCT GCTGTCCAAG TACGACTTCC CCGGTGACGA CACCCCCATC 
ATCAAGGGCT CGGCCAAGCT GGCCCTCGAA GGCGACAAGG GCGAACTGGG CGAAGGCGCC 
ATTCTCAAGC TGGCCGAGGC CCTGGACACC TACATCCCCA CGCCCGAGCG GGCCGTCGAC 
GOOTCOTTCC TCATGCCCGT GGSMacmv- 

™™ "CTCCATCT CCGGGCGCGG CACGGTGGTC 
ACCGGGCGTG TGGAGC6CGG C« TO GTCGGCGAGG AflATCGAQAT TGTCGGCCTC 
-OCCCGC c _ CCGGc GTGGAAATGT TCAGGAAGG, OC^COG 

«-« GCGACAACGT CGGCATCTTG Cc „ 
—GCAGG TGGTG^AA ACCCGGCTCG A.CAAGGCGC AGAGCGA OT CACCGCCGAG 
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GTCTACGTGC «— « «»» «»» """"" 

CCGCA6TTCT »«<«« ■»»«= — «~ 

TGCCCC^ »»MC *T<™ — °" 

4IOT1MM c gccoccct. cccotccsc «««» — «— « 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: a77irflinl 
(A) ORGANISM: Treponema pallidum 

( Xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 169: 
AATATGATCA CGGGTGCTGC GCAGATGGAC GGTGGTATTC TCGTCGTGTC TGCGCCTGAC 
G „C CACAGACGAA GGAGCATCTT CTGCTCGCCC GTCAGGTTGG TGTTCCCTCC 
ATCATTGTTT TTTTGAACAA GGTTGATTTG GTTGATGATC CTGAGTTGCT AGAGCTGGTG 
GAAGAAGAGG TGCGTGATGC GCTTGCTGGA TATGGGTTTT CGCGTGAGAC GCCTATCGTC 
AAGGGGTCTG CGTTTAAAGC TCTGCAGGAT GGCGCTTCCC CGGAGGATGC AGCTTGTATT 
GAGGAACTGC TTGCGGCCAT GGATTCCTAC TTTGAAGACC CAGTGCGTGA CGACGCAAGA 
CCTTTCTTGC TCTCTATCGA GGATGTGTAC ACTATTTCTG GGCGTGGTAC CGTTGTCACG 
GGGCGCATCG AATGTGGGGT AATTAGTCTG AATGAAGAGG TCGAGATCGT CGGGATTAAG 
CCCACTAAGA AAACAGTGGT TACTGGCATT GAGATGTTTA ATAAGTTGCT TGATCAGGGA 
ATTGCAGGTG ATAACGTGGG GCTGCTTTTG CGCGGGGTGG ATAAAAAAGA GGTTGAGCGC 
GGTCAGGTGC TTTCTAAGCC CGGTTCTATT AAGCCACACA CCAAGTTTGA GGCGCAGATC 
TACGTGCTCT CTAAGGAAGA GGGTGGCCGT CACAGTCCTT TTTTTCAAGG TTATCGTCCG 
CAGTTTTATT TTAGAACTAC TGACATTACC GGTACGATTT CTCTTCCTGA AGGGGTAGAC 
ATGGTGAAGC CGGGGGATAA CACCAAGATT ATAGGTGAGC TCATCCACCC GATAGCTATG 
GACAAGGGTC TGAAGCTTGC GATTCGTGAA GGGGGGCGCA CTATTGCTTC TGGT 
(2) INFORMATION FOR SEQ ID NO: 170: 
(i) SEQUENCE CHARACTERISTICS: 
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A LENGTH: 891 base pairs 
fB) TYP B: nucleic aci l 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Vi) ORIGINAL SOURCE • 

(A) ORGANISM : Vr eaplasma ureaJyticum 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 . 
AATATGATTA CAGGGGCAGC ACAAATGGAT GGAGCAATTT TAGTTATTGC 
GGGGTTATGG CTCAAACTAA AGAACATATT TTATTAGCAC GTCAAGTTGG 
ATCGTTGTTT TCTTAAACAA ATGTGATTTC ATGACAGATC CAGATATGCA 
GAAATGGAAG TTCGTGAATT ATTATCTAAA TATGGATTTG ATGGCGATAA 
ATTCGTGGTT CAGGTCTTAA GGCTTTAGAA GGAGATCCAG TTTGAGAAGC 
GAATTAATGG ACGCAGTTGA TTCATGAATT CCATTACCAG AACGTAGTAC 
TTCTTATTAG CAATTGAAGA TGTATTCACA ATTTCAGGAC GTGGTACAGT 
CGTGTTGAAC GTGGTGTATT AAAAGTTAAT GATGAGGTTG AAATTGTTGG 
ACTCAAAAAA CTGTTGTTAC AGGAATTGAA ATGTTTAGAA AATCATTAGA 
GCTGGTGATA ATGCTGGTAT TTTATTACGT GGTATTAAAA AAGAAGATGT 
CAAGTACTTG TAAAACCAGG ATCAATTAAA CCTCACCGXA CTTTTACTGC 
ATTCTTAAAA AAGAAGAAGG TGGACGTCAT ACACCTATTG TTTCAGGATA 
TTCTATTTTA GAACAACAGA TGTAACAGGT GCTATTTCAT TACCTGCTGG 
GTTATGCCAG GTGATGACGT TGAAATGACT GTAGAATTAA TTGCTCCAGT 
QATGGATCTA AATTCTCAAT CCGTGAAGGT GGTAAAACTG TAGGTCATGG 
(2) INFORMATION FOR SEQ ID NO: l 7i; 

fi) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE - 

(A) ORGANISM: WolineUz succino 9 enes 
C*i> SEQUENCE DESCRIPTION: SEQ ID NO: l 7l; 



TGCATCTGAT 
TGTTCCAAAA 
AGATCTTGTT 
CACACCAGTT 
AAAAATTGAT 
TGACAAACCA 
AGTAACTGGA 
TCTAAAAGAC 
TCAAGCTGAA 
TGAACGTGGT 
TAAAGTTTAT 
CCGTCCACAA 
TGTTGATTTG 
TGCGATTGAA 
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„«. CAGG^C »— — ™ ™" 
GGCCCGATGC CCGAAAC T AG — »» — Cm!CraC 
ATCSTGGTTT — - " 

„AG ^AGAGAAC, TCITAGCAAC T ACGA OT CC C—A «~ 
GTTGCAGGTT CCHCH* — « -~- «— » 

„ga aagta^aa " gagg^gacc T ™g 

ATAAGCCTTT — «—» 
G0 „CC3 TO -.a— «— CGAAGTAGAA 

ATCGTTGOTA TCCGAAACAC ACAAAAAACA ACCSTAACTS «««« 8TTCCGAAAA 
GAGCTCGA3A AGGGTGAGGC GGGTGACAAC GTTGGTGTTC «»— CAGCAAGAAA 
AGAGAGGTAT GGTTCTTTGT AAAATAGGTT C^CACCC TCACACTAAC 
„,G AAGTTTACGT TCTTTCCAAA GAGGAAGGCG GACGACACAC TCCA^C 
AATGGATACC SACCTCAGTT CTATGTTAGA ACACAGAC ^CGG^C TATCTCTCTT 
«G TAGAGATGGT TATGCCTGGT GACAACGTTA AGATCAATGT TGAGCTTATC 

ttcGCGATCC GTGAAGGTGG TCGAACCGTT 
GCTCCTGTAG CCCTCGAAGA GGGAACACGA TTCGCGATC 

GGTGCGGGT 

(2) INFORMATION FOR SEQ ID NO: 172: 

ri) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iXl ^AME/KEV: misc.feature 

K OTHER^ INFORMATION : /note= = inosine" 

(iX> ^AME/KEY: .iscxeature 

13 o°TSI I INFO R MATION:/no t e= = inosine- 

UX) ^AME/KEY: .isc.feature 

S OTHER 1 IHF°R MATI0N: / note= - - - OSine " 
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<*i) SEQUENCE D B SCRIP X i 0N: S£Q ^ ^ _ 
TARTCNGTRA ANGCYTCNAC rcacat 
<2J INFOR ^TION FOR SEQ 1D N0; . 

(i) SEQUENCE CHARACTERISTICS 

TYPE: nucleic acid 
(C) STRAND BDNESS • 8in „j. 
») TOPOLOGY: lineal * 

Ui) M ° LECULE «»: «A (genomic) 

<*i> SEQUENCE DESCRIPTION: SEQ ID NO- 173 . 

TCTTTAGCAG AACAGGATGA A 

W INFORMATION FOR SEQ I D N0: „ 4| 

(i) SEQUENCE CHARACTERISTICS- 
A LENGTH: 20 base ^ 
B TYPE: nucleic acid 
'S TRANDEDNESS : single 
TOPOLOGY., linear 

«i) MOLECULE TYPE: DNA ( 9enoraic , 

(Xi> SBQDENCE ^RIPTION: SEQ ID N0 . 
GAATAATTCC ATATCCTCCG 
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CLA1MS 

What is claimed is: 

, a method using probes and/or amplification primers which are specific, 

ki w« bla b/aZ aadB, aacC1, aacC2, aacC3, aac6-//a, aacM, 
of b/a^ Wa^ Wa* Wa - D ' az ' ' f ermA efm B, ermC, 

aa« van*. vanB, vanC, msrA, satA, aac(6)-aph(2 ), vat, g 

mecA, int and sul, and 

o, Entemcoccus feodum, U— ^^LaT Can«a afbr'cans, 

species and Candida species, 

wh e rei neac h o,sa M nuc«e a c W orv a „an,orpan t he re o f co m p rt sesasa,ec t ed 
" .arget region hybridizable with said probes or primers; 

prob es or phmers and date*, ft. presence an^ --^n, o. said specific 
Q mniified Droducts as an indication of the presence anu, 
,„ rnaSor.ongarspec.sandba.eriaian.ibio.icresisranoegenes. 

and/or amount o. nucleic acids from an, bactenum or fungus. 
3 The mefhod of claim 1, which is performed direcff, from a .es, sample. 
25 4 The me,hod o, ^ ,, whfcn is performed dire* from a ,es« sample 
consisting of a bacterial and/or fungal culture or suspense 
. The method o, Calm ,, wherein sa« nucleic aoida are at, detected under 
uniform hybridization or amplification conditions. 

6 . T ha method o, Calm 1 . wherein sa* nucleic acids are amplifed b, a method 
30 selected from the group consisting of: 

a) polymerase chain reaction (PCR), 

b) ligase chain reaction (LCR), 

c) nucleic acid sequence-based amplification (NASBA), 
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d) self-sustained sequence replication (3SR), 

e) strand displacement amplification (SDA), 

0 branched DNA signal amplification (bDNA), 
5 9) ^"^on-mediated amplification (TMA), 

h) cycling probe technology {C PT), 
0 nested PCR, and 
j) multiplex PCR. 

r. *s a ,„ ucteicacidsareampliBedbypcR 

acds by p erformlng for ea^nl!?™"*" °' ,he ■"«— * — 
•» specfflca,,, allowad ,o a„ elongaln X * ""^ a "> 

« ^-«^^^^JS^-** ~„on of . 

a.e StS a mpte orfrom bacteHa, a„d,o r ^ " ^ *c„y , rom 
20 ste Ps: 9 Cu " ure * ™ h, ch comprises the following 

microorgantern^Wof "t^ZZ^" T" 1 teavin 9 m ***» (ha said 
•* microorganism isolated from 1^^"^ of 

wculated sampla or said Isolated raicZl SUPP ° rt - and lysin 9 «*< "Id 
DNA Wad mrcroorgamsm ,„ re , ease lne said mjOTOISanism 

9«H.p consist ofSEQ ID NOa- 26 27 ^ TTf SK " ,e " Ce " ■**«• «■ 
32 to 38. 120 to 124. a saquance oo m „, . ' " 1 10 134 ' «• »« to 143 

'2 ~as in teng^ ~^^^«-of^a,^ 
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ninaitiriis Staphylococcus saprophytics, Streptococcus 
nf ..id hvbridization complex on said inert support or 

in said test sample. 

,„ h ,„, detecting the presence and/or amount of a m,croorgan,s m 
10 . A method for detecting in P Ljsterja moTOCy togene S , 



steps: naqueoussolulionconta ,„ ing at,eas,onepa„o f 
a) Heating sa,d saraple 0 „ e of said pdmets 

oiigonucleotide primers hav,ag at least 12^ nucleoM 5(rands o( 

be^capab.eofhybraiz.ngsetecWywrt.oneo Ihe^o P - ^ 

^microorganism DNA that confab W£ * a$ , Q form an 

pr „ne,s b eing cap* a, - sa|d atteaston e 

3 extension produclwhtch ^ sequence „„„,„ me group consisting 

p* ofprfcners ^ * t0 A 31. 140 .0 143, 32 .0 36. 120 .0 

JT-S£» - - — n,sro ' a " romp 

thereof and a variant thereof, 

to a detectable level; and anrAaa 

30 sample. qFQ , D N qs; 

1 end 2. 3 and 4, 5 and 6, 7 and 8. 9 ad ia s , aedum . um. 

20 , 21 and 22, ^*J^^^aM««" 

35 ^r^rrrt~a ^ — — 

C^s species and S^ococcus specs. 
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12. Amethod for detecting the presence anrf/n, 
5 from Ibis sample, or "homogeneous population of bacteria isolated 

b~iteui:t^::::tc;:* homosene ° us - 

; idb ; Cte "' a '^^-i„,su b s,a„^ s ,„ g , slrandedform . 

aroup consist 0 , SEQ .0 NOs: ,„ ^ ~f;«^c° is selected *„ m „ 
"hereof, a part thereof having at feast n J*Z* ' " Se,Uence ""plemenlary 
which specify and ubiguitously JllT"' *! "I ten3 ' h ' a "« * -« 
Mortal speofes. under conditions such Lm, °' ""—•*« of any 
hybrtdfce m saa ^0^^*^ * « 
fomed; and ONA ' wneret >y a hybridization complex is 

2 « said test sample. " """o presence and/or amount of any bactertum in 

'3- Am o<nodfordetectingthepresenc»=,„* 

sample which comprises the Wolg steps am0Un ' * "* '"**»> »> a fast 
a) treating said sample with an 

^^'^^oomainsaC^^^^'^'ys.rt.nd,,,, 
capabhof h*,*^^ other^ZS being 
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14 

23 and 24 

15. 



The method of claim 1 3, wherein said pair of primers is defined in SEQ ID NOs: 



„ A memod .or chaining rrrf sequences from any bacterium direcfl, from a fes, 
sampte or a bactena. cuHure. which comprises fhe following sfeps: 

a) .reating said sample w«h an aqueous soiution containing a p*tf pnmers 
• eniipnce selected within the nucleotide sequences defined in SEQ ID NOs. 
having a sequence selected » nucteolides in , e ngth, a sequence 

107 and ,08. a pad .hereof ha»g * 12 ^ - 

complementary thereof, and a variant thereof o ^ 
hytdd^g sefcCively - *J^°— ^ l re ^ oapabia o, 
'° r=ant stas ,o rarm an .tension prod. wh*h 

contains the target sequence as a template, 

15 to a detectable level; and 

c) de.eCing.he presence ^oramoun, of said ampl^farge. sequence- and 

« denning .be nucieotide se q uence o, ft. said amp.ifled .arge, sequence 
by using any DNA sequencing method. 

sample, or 

or isolated fungi to release the fungal DNA. 

said fungal DNA being made in a substanUally singte-sUanded form; 

teas.one sing.e-s.randed nucleoMe "•""-^"E, hereo f hav.ng 

whereby a hybridization complex is formed; and 

c) dete*ingthepresenceof^^^ 
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in said solution as an indication *h 

-estsampte. ^ «» pre — -d/ora m „u„, o, any (ungus ,„ MW 

17- A method for detecting the 

sample which comprises the fcfcj^ "** am0l,n ' ° f an >- »■»• * . test 

primers 

any langa, DNA ,ha, contains a ^ Zl**!. ""^"tary strands of 

^ g chosen from . nucleo| ^ " ^ Pair °' >*«• 

120 to 124. a sequence ' ^ ~9 - SEQ ID NOs: 

* inereof ' and a variant thereof- 

b) synthesizing an extension product of ea rh.f ■„ 
product containing the target sequence and a l n rl """^ ^ eXtensio " 

'5 to a detectable level; and mP ' ,fy,n9 Sa,d far 9 e * sequence, if any, 

W- A method for obtaining fu/ sequences from . 
eempteora^a,^, wWch c^^SZ^ '» a - 

'"9 -d ,72. a part thereof having a " e Ttt ^ SEQ ID N °* 

complementary thereof, and a variant the™, o n eT"f " * S ^ 

hybridizing select^ M 0 „ e of the tw^mll , bei " 3 " P- * * 

genethatcontainsa^rgetsequence ""? ,a,y s,ra " d s of said fungal W 

nybridizing with ,„ e olher oinZtTlT, *** P ' imerS "*« «■"* °' 
contains the target sequence as a template; < " tenSi ° n Prodl,ct "*** 

b) synthesizing an extension oroduoi t . 
P-oduc, containing me targetsequence andalo 1 " """^ «* «*»*• 
to a detectable level; and ' "^"""'^aaid targe, sequence, if any. 

<0 determ.n.ng the nucleotide sequence of h» e „ 
by using any DNA sequencing method arnpMed larget 
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• nf h/a WaZ aac6'-//a, erwA, ermB, ermC, vanB, vanC 
is 5e ,ec.ed from the group coasts M of SEQ » »M* spe** 

ssr; ==■— -7^^- 

nw nf a hybridization complex as an inan.o 
bacterial re*tence medtated by sa,d lhBCTalliatto „„ f the presence 

o( a bacteria, — «* -C. «* - C ' 

, ro m me group «*W * " ^ which ^ .He Wowing steps: 
20 directly from a test sample or a o „,„ iM at least one pair of 

p*n» Having a, teas, 12 nudeoM- *"* - - 

antibionc resistance gene ^«*»' 9 o| ^ strand5 so as to torn, an 
25 primers being cepacia of M^^-*, as . opiate, sard at .east one 

0( SEQ ID NOs: 110, 111. «*. "^sequence compte men.ary thereof, and a 
said bacterial antibiotic resrstance gene, a sequen 

30 variant thereof. „, ea ch at said primers, sard extension 

product containing the targei &c M 

to a detectable level; and sequence as 

^ resistance genes. s0B ST»TUTE SHEET (RULE 26) 
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*<W UaZ, aadB, aaccr aacc? »Z 9 ™ p consis »"9 of bla Wa „ 

a) bating said sampfe an "» **>«*>9 teps: 

41 to 44. 45 to 48, 49 and 60 51 „„„ * c0 ™*>9 of SEQ | D NOs . „ ,„ '„ 

« to 68, 57 and 88, 89 and 90. 9, " 'J 75 « 77 ,„ 80. 8, and 8^' 

^quence as a template; eXtension ^ which confains ,he 2,* 

to a detectable tevel; and ' amp " fy ' n 9 «* target sequence, if any " 

c) detecting the presence anw/„ 
« o,a bactedai ^ r ;"°: , ; , r W ^ 1 ar 9 e tS e qU e„ ceas 

a " re — - by one of said baoteda, an0 ^ 

W which, when m sin 9 ,e-s, ra „; ed rr:;:7 teme "^ *~* « ™Ham 
•* = targe, bacteda, or to„g a , 0NA as - 

» ."zrzrrr 

a-ordi„ 9 ,o ciai m 24 . *M«d by . recorab|r , ant ^ 

26 - A recombinant host according to claim « . 



20 



PCT/CA97/00829 - 

WO 98/20157 - 164- 

. I cies and Cand/da species. « " consfeli „ g of SEQ ID NOs: 26 

5 -«^^^^^«-^~^ - 



10 



15 



wi 

valiants thereof. ..^ification of the nucleic acids of any 

of Enferococcus faecmm, ™ a aga/acSae, Candida aldrcans, 

s'apMococo- ' „,,, spede s, Sfrepfococors 

W 1— NeiSSe " a a XU combination of primers of * 

fpldes and Candida spe*s. consl s,in 9 of SEQ ID NOst 26 

variants thereof. „„ a „Mcalion of the nucleic acids of any 

29 Adiagnosf^ifforuredefectiona^or^^h romthegrw consist 

0 , entero coocus faaciom. bste ™ ™ „ s aga , ac Baa, Candida a/Prcana, 
S^coccrrs .•«*•»* ^SloLspedeea"-^^ 

5 tangur.seouencescomplententa^hereo.. nucletoa( *, 5 ofany 

l^-^^r^jT^ vane, coning « 

30 consisting of SEQ iu 

variants thereof. h ucteic acids of any 

!Ul«-*^ , ^^?-e comprising any Saitoh* 

M c „ m bina.ion o, ipnmers £ -11 ^ hereof, 
consisting of SEQ »■> 
variants thereof. 
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32 A diagnostic kit for the cfawr 
,so - A diagnostic to f,wi._ . . 



35 a j. vanants thereof. 

36 - A d,agnos« c kit, as defined m h • V3nantS therec * 

A diagnost/c kit w 
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40. A ^ a 9 n0St,C " rt ofatleast1 2 nucleotides « length select ^ 

• - — — 

M-> " om,he ana variants the-eof. ' ^ , esislanc e 



saqU ence from «* 9™ ■ - ^ ^* w ^ ^ 

sequencas "mP^^ ^ nudete add s of any M*W> a * aadB , 

a^/or ^ ^ t«sW o. »- * - * ^ van*. 
g e„a seized from*» 9™£ ^ amA «A 

aacCI. aac02. aatC3, aacA4 a ^ and ,„,. 

43 . A diagnose k * * 12 nudeotid es in length select s 

=r-= — 

M . _ _ j^rm^ri in claim ^» TUl .u.«ho <;eai # 



lYteast 12 nuclaotidas m leng* ***** complement 
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^ fa „ ybacleriumand/ 

46 a -i. 



46 - A diagnostic kit as rfo«„ , . 
se 1"ence from „ r ,2nu< **tetole ( ia|i.<=i 7 9 any suila l>fe 
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